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CHARGE TAGS AND THE SEPARATION OF NUCLEIC ACID MOLECULES 

The present invention is a continuation-in-part of pending U.S. patent application 
No. 09/333,145, which is a continuation application of U.S. Patent No. 6,001,567, herein 
5 incorporated by reference in its entirety. 

FIELD OF THE INVENTION 

The present invention relates to novel phosphoramidites, including positive and 
neutrally charged compounds. The present invention also provides charge tags for 
10 attachment to materials including solid supports and nucleic acids, wherein the charge 
tags increase or decrease the net charge of the material. The present invention further 
provides methods for separating and characterizing molecules based on the charge 
differentials between modified and unmodified materials. 

1 5 BACKGROUND OF THE INVENTION 

Methods for the detection and characterization of specific nucleic acid sequences 
and sequence variations have been used to detect the presence of viral or bacterial nucleic 
acid sequences indicative of an infection and to detect the presence of variants or alleles 
of genes associated with disease and cancers. These methods also find application in the 

20 identification of sources of nucleic acids, as for forensic analysis or for paternity 

determinations. Various methods are known to the art that may be used to detect and 
characterize specific nucleic acid sequences and sequence variants. Nonetheless, with the 
completion of the nucleic acid sequencing of the human genome, as well as the genomes 
of numerous other organisms such as pathogenic organisms, the demand for fast, reliable, 

25 cost-effective and user-friendly tests for the detection of specific nucleic acid sequences 
continues to grow. Importantly, these tests must be able to create a detectable signal 
from samples that contain very few copies of the sequence of interest. 

There are a number of techniques that have been developed for characterizing 
specific nucleic acid sequences. Examples of detection techniques include the "TaqMan" 

30 or nick-translation PCR assay described in U.S. Patent No. 5,21 0,01 5 to Geifand et al 

(the disclosure of which is herein incorporated by reference), the assays described in U.S. 
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Patent Nos. 4,775,619 and 5,1 18,605 to Urdea (the disclosures of which are herein 
incorporated by reference), the catalytic hybridization amplification assay described in 
U.S. Patent No. 5,403,71 1 to Walder and Walder (the disclosure of which is herein 
incorporated by reference), the cycling probe assay described in U.S. Patents Nos, 
5 4,876,1 87 and 5,01 1 ,769 to Duck et aL, the target-catalyzed oligonucleotide modification 
assay described in U.S. patents 6,1 10,677 and 6,121,001 to Western et al. (the disclosures 
of which are herein incorporated by reference), the SNP detection methods of Orchid 
Bioscience in U.S. patent 5,952,174 (the disclosure of which is herein incorporated by 
reference), the methods of U.S. patent 5,882,867 to UHman et al. (the disclosure of 

1 0 which is herein incorporated by reference) the polymerase chain reaction (PCR) 

described in U.S. Patent Nos. 4,683,195, 4,683,202, and 4,965,188 to Mullis and Mullis 
et al. (the disclosures of which are herein incorporated by reference) and the ligase chain 
reaction (LCR) described in U.S. Patent Nos. 5,427,930 and 5,494,810 to Birkenmeyer et 
aL and Barany et aL (the disclosures of which are herein incorporated by reference). The 

15 above examples are intended to be illustrative of nucleic acid-based detection assays and 
do not provide an exhaustive list. Bach of these techniques requires a detection step for 
detecting a reaction product that is indicative of a desired target nucleic acid (e.g., 
detection of cleavage products, extension products, etc.). While a number of advances 
have been made in the assay methods and detection instrumentation to improve the 
20 sensitively, speed, and cost of detection methods the art is still in need of further 

improved methods, compositions, and systems to make the assays more sensitive and 
efficient. 

SUMMARY OF THE INVENTION 

25 The present invention relates to novel phosphoramidites, including positive and 

neutrally charged compounds. The present invention also provides charge tags for 
attachment to materials including solid supports and nucleic acids, wherein the charge 
tags increase or decrease the net charge of the material. The present invention further 
provides methods for separating and characterizing molecules based on the charge 

30 differentials between modified and unmodified materials. 
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For example, the present invention provides a composition comprising a charge 
tag attached to a nucleic acid molecule (e.g., to a terminal end of a nucleic acid 
molecule). In some embodiments, the charge tag comprises a phosphate group and a 
positively charged moiety. In some preferred embodiments, the charge tag further 
5 comprises a dye. The present invention is not limited by the position of the individual 
modular components of the charge tag. For example, in some embodiments, the dye is 
positioned between the nucleic acid and the positively charged moiety, while in other 
embodiments, the positively charged moiety is positioned between the nucleic acid and 
the dye. The present invention is also not limited by the number of each type of 
10 component in the charge tag (e.g., the number of dyes, positively charged moieties, etc.). 
For example, in some embodiments, the charge tag comprises first and second positively 
charged moieties. 

In some embodiments of the present invention, the charge tag has a net positive 
charge. For example, in some embodiments, the charge tag has a net positive charge of 1 , 

15 2, 3, etc. In some embodiments, the charge tag possesses a positive charge only under 
certain reaction conditions (e.g., pH 6-10). 

In some embodiments, the charge tag further comprises one or more nucleotides. 
In some embodiments, the nucleic acid molecule to which the charge tag is attached 
contains a sequence that is complementary to a target nucleic acid. In some such 

20 embodiments, the one or more nucleotides in the charge are not complementary to the 

target nucleic acid. In other such embodiments, the nucleic acid comprises a first portion 
complementary to a target nucleic acid and a second portion that is not complementary to 
said target nucleic acid, wherein the charge tag is attached to the second portion of the 
nucleic acid (e.g., to a terminal end of the nucleic acid that is located in the second 

25 portion). 

In some embodiments of the present invention, the nucleic acid and the charge tag 
have a combined net neutral charge, wherein the charge tag, in isolation, has a net 
positive charge. In other embodiments, the nucleic acid and the charge tag have a 
combined net negative charge, wherein the charge tag has a net positive charge. 
30 The present invention is not limited by the nature of the positively charged moiety 

of the charge tag. Positively charged moieties include, but are not limited to primary 
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15 



20 



25 



amines, secondary amines, tertiary amines, ammonium groups, positively charged metal 
groups (e.g., caged ions attached to the charge tag through a linking group), and the like. 

In some embodiments, the charge tag further comprises a positively charged 
phosphoramidite or a neutral phosphoramidite. The present invention is not limited by 
the nature of the positively charged phosphoramidite or the neutral phosphoramidite. For 
example, in some embodiments, the charge tags comprise a novel phosphoramidite of the 
present invention. 

For example, the present invention provides a composition comprising a 
positively charged phosphoramidite. In some embodiments, the positively charged 
phosphoramidite contains one or more positively charged moieties including, but not 
limited to, primary amine groups, secondary amine groups, tertiary amine groups, 
ammonium groups, charged metal ions, and the like. In some embodiments, the 
phosphoramidite has a net positive charge of one. In some particularly preferred 
embodiments, the phosphoramidite has the structure: 



wherein, X is a reactive phosphate group (e.g., PO4) and Y is a protecting group (e.g., 
dimethoxy trityl [DMT]) and/or a protected group (e.g., DMT-protected hydroxyl group). 

The present invention further provides a composition comprising a nucleic acid 
molecule containing a positively or neutrally charged phosphoramidite. The present 
invention also provides a composition comprising a charge tag attached to a terminal end 
of a nucleic acid molecule, wherein the charge tag comprises a positively charged or 
neutrally charged phosphoramidite. In some preferred embodiments, the positively 
charged phosphoramite comprises an amine group, wherein the amine group is not 
further attached to another molecule (a molecule other than the phosphoramidite). 

The present invention further provides a composition comprising a neutrally 
charged phosphoramidite. In some preferred embodiments, the neutrally charged 
phosphoramidite comprises a nitrogen-containing chemical group selected from the 
group comprising primary amine, secondary amine, tertiary amine, ammonium group, 
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and charged metal ion. In some embodiments, the composition further comprises a 
nucleic acid molecule attached to the neutrally charged phosphoramidite. In some 
preferred embodiments, the nucleic acid molecule is attached to a charge tag comprising 



order, other components. For example, the charge tag may further comprise a positively 
charged phosphoramidite. In some embodiments of the present invention, the charge tag 
containing the neutrally charged phosphoramidite has a net positive charge. In some 
particularly preferred embodiments of the present invention, the neutrally charged 
phosphoramidite has the structure: 



wherein X is a protecting group {e.g., dimethoxy trityl group [DMT]) and/or a protected 
group {e.g., DMT-protected hydroxyl group), 2 is a reactive phosphate, and N comprises 
an amine group. In some preferred embodiments, the N group is N-(CH 2 ) n CH3, wherein 
n is 0 or a positive integer from I to 12. 

The present invention also provides a composition comprising a solid support 
attached to a charge tag. For example, in some embodiments, the charge tag comprises a 
positively charged moiety and a reactive group configured to allow the charge tag to 
covalently attach to a nucleic acid molecule. Any of the charge tags described herein, 
may be attached to the solid support. 

The present invention further provides a composition comprising a fluorescent 
dye directly bonded to a phosphate group, wherein the phosphate group is directly 
bonded to an amine group. In some embodiments, the composition comprises a charge 
tag, wherein the fluorescent dye is contained within the charge tag. The present invention 
is not limited by the nature of the fluorescent dye. However, in some preferred 
embodiments, the fluorescent dye comprises a Cy dye {e.g., Cy3). 

The present invention also provides a mixture comprising a plurality of 
oligonucleotides attached to charge tags. In some embodiments, each oligonucleotide is 
attached to a different charge tag. In other embodiments, two or more different 
oligonucleotides have the same type of charge tag. In some preferred embodiments, each 



the neutrally charged phosphoramidite. The charge tag may further comprise, in any 
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of the charge tags comprises a phosphate group and a positively charged moiety. While 
not limited by the number of oligonucleotides attached to different charge tags, in some 
embodiments, the plurality of oligonucleotides comprises four or more oligonucleotides 
(e.g., 5, 6, 7, . . 10, . . 50, . . 100, . . .), each attached to a different charge tag. Any 
5 of the charge tags described herein are contemplated for use in the mixtures. 

The present invention further provides a method of separating nucleic acid 
molecules, comprising the steps of: a) treating a charge-balanced oligonucleotide 
containing a charge tag under conditions such that a charge-unbalanced oligonucleotide 
containing the charge tag is produced, wherein the charge-unbalanced oligonucleotide is 

10 contained in a reaction mixture; and b) separating the charge-unbalanced oligonucleotide 
from the reaction mixture. While the present invention is not limited by the means by 
which a charge-unbalanced oligonucleotide is generated, in some preferred embodiments, 
the oligonucleotides are treated with a reactant (e.g. , a nuclease). Any of the charge tags 
described herein are contemplated for use in the method. While the present invention is 

1 5 not limited by the nature of the separation step, contemplated separation steps include, 
but are not limited to, gel electrophoretic separation, capillary electrophoretic separation, 
capillary zone electrophoretic separation, and separation is a microchannel. 

The present invention also provides a method of separating nucleic acid 
molecules, comprising the steps of: a) treating a plurality of charge-balanced 

20 oligonucleotides, each containing different charge tags, under conditions such that two or 
more charge-unbalanced oligonucleotides containing the charge tags are produced, 
wherein the charge-unbalanced oligonucleotides are contained in a reaction mixture; and 
b) separating the charge-unbalanced oligonucleotides from the reaction mixture. In some 
preferred embodiments, the separating comprises separating the charge-unbalanced 

25 oligonucleotides such that charge-unbalanced oligonucleotides containing different 
charge tags are separated from one another. Any of the charge tag, oligonucleotide 
mixtures, and separation methods described herein may be used with this method. 

DEFINITIONS 

30 To facilitate an understanding of the present invention, a number of terms and 

phrases are defined below: 
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The term "charge-balanced" molecule or oligonucleotide refers to a molecule or 
oligonucleotide (the input oligonucleotide in a reaction) that has been modified such that 
the modified molecule or oligonucleotide bears a charge, such that when the modified 
molecule or oligonucleotide is either reduced in size (e.g., cleaved, shortened, 
5 disassociated, unbound, or otherwise altered such that it is part of a complex or molecule 
having a lower aggregate molecular weight) or increased in sized (e.g., enlarged, 
elongated, associated, bound, or otherwise altered such that it is part of a complex or 
molecule having a higher aggregate molecular weight), a resulting product bears a net 
charge or charge to mass ratio different from the input molecule or oligonucleotide (the 

10 resulting molecule thus being a "charge- unbalanced" molecule or oligonucleotide) 

thereby permitting separation of the input and reacted molecules or oligonucleotides on 
the basis of charge. The term "charge-balanced" does not imply that the modified or 
balanced molecule or oligonucleotide has a net neutral charge (although this can be the 
case). Charge-balancing refers to the design and modification of a molecule or 

15 oligonucleotide such that a specific reaction product generated from this input molecule 
or oligonucleotide can be separated on the basis of charge from the input molecule or 
oligonucleotide. 

For example, in an INVADER oligonucleotide-directed cleavage assay in which 
the probe oligonucleotide bears the sequence: 5* TTCTTTTCACCAGCGAGACGGG 3' 

20 (i.e. , SEQ ID NO: 1 without the modified bases) and cleavage of the probe occurs 
between the second and third residues, one possible charge-balanced version of this 
oligonucleotide would be: 5' Cy3-AminoT-Amino-TCTTTTCACCAGCGAGAC GGG 
3' (SEQ ID NO:l). This modified oligonucleotide bears a net negative charge. After 
cleavage, the following oligonucleotides are generated: 5' Cy3-AminoT-Amino-T 3* and 

25 5' CTTTTCACCAGCGAGACGGG 3' (residues 3-22 of SEQ ID NO:l). 5' Cy3- 

AminoT-Amino-T 3' bears a detectable moiety (the positively charged Cy3 dye) and two 
amino-modified bases. The amino-modified bases and the Cy3 dye contribute positive 
charges in excess of the negative charges contributed by the phosphate groups and thus 
the 5' Cy3-AminoT-Amino-T 3 f oligonucleotide has a net positive charge. The other, 

30 longer cleavage fragment, like the input probe, bears a net negative charge. Because the 
5' Cy3-AminoT-Amino-T 3' fragment is separable on the basis of charge from the input 
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probe (the charge-balanced oligonucleotide), it is referred to as a charge-unbalanced 
oligonucleotide. The longer cleavage products are not generally separated on the basis of 
charge from the input oligonucleotide as both oligonucleotides bear a net negative 
charge. 

^ 5 The term "net neutral charge" when used in reference to a molecule or 

oligonucleotide, including modified oligonucleotides, indicates that the sum of the 
§ charges present (e.g., R-NH3+ groups on thymidines, the N3 nitrogen of cytosine, 

$ presence or absence or phosphate groups, etc.) under the desired reaction or separation 

conditions is essentially zero. A molecule or oligonucleotide having a net neutral charge 
1 0 would not migrate in an electrical field. 

The term "net positive charge" when used in reference to a molecule or 
oligonucleotide, including modified oligonucleotides, indicates that the sum of the 
charges present (e.g., R-NH3+ groups on thymidines, the N3 nitrogen of cytosine, 
presence or absence or phosphate groups, etc.) under the desired reaction conditions is +1 
1 5 or greater. A molecule or oligonucleotide having a net positive charge would migrate 
; toward the negative electrode in an electrical field. 

^ The term "net negative charge" when used in reference to a molecule or 

oligonucleotide, including modified oligonucleotides, indicates that the sum of the 
charges present (e.g., R-NH3+ groups on thymidines, the N3 nitrogen of cytosine, 
20 presence or absence or phosphate groups, etc.) under the desired reaction conditions is -1 
or lower. A molecule or oligonucleotide having a net negative charge would migrate 
toward the positive electrode in an electrical field. 

As used herein, the terms "complementary" or "complementarity" are used in 
reference to polynucleotides (/.£., a sequence of nucleotides such as an oligonucleotide or 
25 a target nucleic acid) related by the base-pairing rules. For example, for the sequence " 
5'-A-G-T-3'," is complementary to the sequence " 3'-T-C-A-5'. H Complementarity may be 
"partial," in which only some of the nucleic acids* bases are matched according to the 
base pairing rules. Or, there may be "complete" or "total" complementarity between the 
nucleic acids. The degree of complementarity between nucleic acid strands has 
30 significant effects on the efficiency and strength of hybridization between nucleic acid 
strands. This is of particular importance in amplification reactions, as well as detection 
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methods which depend upon binding between nucleic acids. Either term may also be 
used in reference to individual nucleotides, especially within the context of 
polynucleotides. For example, a particular nucleotide within an oligonucleotide may be 
noted for its complementarity, or lack thereof, to a nucleotide within another nucleic acid 
5 strand, in contrast or comparison to the complementarity between the rest of the 
oligonucleotide and the nucleic acid strand. 

The term "homology" and "homologous" refers to a degree of identity. There 
may be partial homology or complete homology. A partially homologous sequence is 
one that is less than 100% identical to another sequence. 

10 As used herein, the term "hybridization" is used in reference to the pairing of 

complementary nucleic acids. Hybridization and the strength of hybridization the 
strength of the association between the nucleic acids) is influenced by such factors as the 
degree of complementary between the nucleic acids, stringency of the conditions 
involved, and the T,„ of the formed hybrid. "Hybridization" methods involve the 

15 annealing of one nucleic acid to another, complementary nucleic acid, i.e., a nucleic acid 
having a complementary nucleotide sequence. The ability of two polymers of nucleic 
acid containing complementary sequences to find each other and anneal through base 
pairing interaction is a well -recognized phenomenon. The initial observations of the 
"hybridization" process by Marmur and Lane, Proc. Natl. Acad, Sci. USA 46:453 (1960) 

20 and Doty et aL, Proc. Natl. Acad. Sci. USA 46:46 1 (1 960) have been followed by the 
refinement of this process into an essential tool of modern biology. 

With regard to complementarity, it is important for some diagnostic applications 
to determine whether the hybridization represents complete or partial complementarity. 
For example, where it is desired to detect simply the presence or absence of pathogen 

25 DNA (such as from a virus, bacterium, fungi, mycoplasma, protozoan) it is only 
important that the hybridization method ensures hybridization when the relevant 
sequence is present; conditions can be selected where both partially complementary 
probes and completely complementary probes will hybridize. Other diagnostic 
applications, however, may require that the hybridization method distinguish between 

30 partial and complete complementarity. It may be of interest to detect genetic 
polymorphisms. For example, human hemoglobin is composed, in part, of four 
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polypeptide chains. Two of these chains are identical chains of 141 amino acids (alpha 
chains) and two of these chains are identical chains of 146 amino acids (beta chains). 
The gene encoding the beta chain is known to exhibit polymorphism. The normal allele 
encodes a beta chain having glutamic acid at the sixth position. The mutant allele 
5 encodes a beta chain having valine at the sixth position. This difference in amino acids 
has a profound (most profound when the individual is homozygous for the mutant allele) 
physiological impact known clinically as sickle cell anemia. It is well known that the 
genetic basis of the amino acid change involves a single base difference between the 
normal allele DNA sequence and the mutant allele DNA sequence. 

10 The complement of a nucleic acid sequence as used herein refers to an 

oligonucleotide which, when aligned with the nucleic acid sequence such that the 5' end 
of one sequence is paired with the 3 1 end of the other, is in "antiparallel association." 
Certain bases not commonly found in natural nucleic acids may be included in the nucleic 
acids of the present invention and include, for example, inosine and 7-deazaguanine. 

1 5 Complementarity need not be perfect; stable duplexes may contain mismatched base 
pairs or unmatched bases. Those skilled in the art of nucleic acid technology can 
determine duplex stability empirically considering a number of variables including, for 
example, the length of the oligonucleotide, base composition and sequence of the 
oligonucleotide, ionic strength and incidence of mismatched base pairs. 

20 As used herein, the term "T m M is used in reference to the "melting temperature." 

The melting temperature is the temperature at which a population of double-stranded 
nucleic acid molecules becomes half dissociated into single strands. Several equations 
for calculating the T m of nucleic acids are well known in the art. As indicated by 
standard references, a simple estimate of the T m value may be calculated by the equation: 

25 T m = 81 .5 + 0.4 1(% G + C), when a nucleic acid is in aqueous solution at 1 M NaCl (see 
e.g., Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid 
Hybridization (1985). Other references (e.g., Allawi, H.T. & SantaLucia, J., Jr. 
Thermodynamics and NMR of internal G.T mismatches in DNA. Biochemistry 36, 
1 0581-94 (1997) include more sophisticated computations which take structural and 

30 environmental, as well as sequence characteristics into account for the calculation of T tn . 
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As used herein the term "stringency" is used in reference to the conditions of 
temperature, ionic strength, and the presence of other compounds, under which nucleic 
acid hybridizations are conducted. With "high stringency" conditions, nucleic acid base 
pairing will occur only between nucleic acid fragments that have a high frequency of 
5 complementary base sequences. Thus, conditions of "weak" or "low" stringency are 
often required when it is desired that nucleic acids which are not completely 
complementary to one another be hybridized or annealed together. 

The term "oligonucleotide" as used herein is defined as a molecule comprising 
two or more deoxyribonucleotides or ribonucleotides, preferably at least 5 nucleotides, 

1 0 more preferably at least about 10-15 nucleotides and more preferably at least about 15 to 
30 nucleotides. The exact size will depend on many factors, which in turn depend on the 
ultimate function or use of the oligonucleotide. The oligonucleotide may be generated in 
any manner, including chemical synthesis, DNA replication, reverse transcription, PCR, 
or a combination thereof. 

1 5 Because mononucleotides are reacted to make oligonucleotides in a manner such 

that the 5' phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of 
its neighbor in one direction via a phosphodiester linkage, an end of an oligonucleotide is 
referred to as the "5' end" if its 5' phosphate is not linked to the 3* oxygen of a 
mononucleotide pentose ring and as the "3* end" if its 3* oxygen is not linked to a 5' 

20 phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid 
sequence, even if internal to a larger oligonucleotide, also may be said to have 5' and 3' 
ends. A first region along a nucleic acid strand is said to be upstream of another region if 
the 3' end of the first region is before the 5' end of the second region when moving along 
a strand of nucleic acid in a 5' to 3' direction. 

25 The term "label" as used herein refers to any atom or molecule that can be used to 

provide a detectable (preferably quantifiable) signal, and that can be attached to a nucleic 
acid or protein. Labels may provide signals detectable by fluorescence, radioactivity, 
colorimetry, gravimetry, X-ray diffraction or absorption, magnetism, enzymatic activity, 
and the like. A label may be a charged moiety (positive or negative charge) or 

30 alternatively, may be charge neutral. Labels can include or consist of nucleic acid or 
protein sequence, so long as the sequence comprising the label is detectable. 
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The term "sample" in the present specification and claims is used in its broadest 
sense. On the one hand it is meant to include a specimen or culture (e.g., microbiological 
cultures). On the other hand, it is meant to include both biological and environmental 
samples. A sample may include a specimen of synthetic origin. 
5 Biological samples may be animal, including human, fluid, solid (e.g., stool) or 

tissue, as well as liquid and solid food and feed products and ingredients such as dairy 
items, vegetables, meat and meat by-products, and waste. Biological samples may be 
obtained from all of the various families of domestic animals, as well as feral or wild 
animals, including, but not limited to, such animals as ungulates, bear, fish, lagamorphs, 
1 0 rodents, etc. 

Environmental samples include environmental material such as surface matter, 
soil, water and industrial samples, as well as samples obtained from food and dairy 
processing instruments, apparatus, equipment, utensils, disposable and non-disposable 
items. These examples are not to be construed as limiting the sample types applicable to 

1 5 the present invention. 

The term "source of target nucleic acid" refers to any sample that contains nucleic 
acids (RNA or DNA). Particularly preferred sources of target nucleic acids are biological 
samples including, but not limited to cultures, blood, saliva, cerebral spinal fluid, pleural 
fluid, milk, lymph, sputum, semen, and animal or plant tissue. 

20 As used herein, the term "charge tag" refers to a modular chemical complex that 

is attached to or to be attached to another molecule, wherein the charge tag has a net 
charge that differs from the net charge of the other molecule. For example, charge tags 
may be attached to nucleic acid molecules (e.g., to the terminal end of a nucleic acid 
molecule). Charge tags contain any number of desired components including, but not 

25 limited to, dyes, linker groups, nucleotides, phosphoramidites, phosphonates, phosphate 
groups, amine groups, fluorescent quencher groups and the like. 

In a "mixture comprising a plurality of oligonucleotides with each oligonucleotide 
attached to a different charge tag," two or more oligonucleotides each possess a distinct 
charge tag, wherein the chemical makeup of the charge tags differ from one another. A 

30 mixture of oligonucleotides, each with a different charge tag, may also comprise 
additional oligonucleotides. For example, the mixture may contain a first set of 
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oligonucleotides, each with identical first charge tags and a second set of 
oligonucleotides, each with an identical second charge tags. 

As used herein, the term "positively charged moiety" refers to a chemical group 
or molecule that contains a net positive charge. Positively charged moieties may be 
5 attached to or associated with other molecules or materials. A composition containing a 
positively charged moiety may itself have a net positive charge (because of the positively 
charged moiety or otherwise), but need not. In some embodiments of the present 
invention, positively charged moieties include, but are not limited to, amines (e.g., 
primary, secondary, and tertiary amines). For example, in some embodiments of the 

1 0 present invention, phosphoramidites contain a positively charged moiety comprising an 
amine. Amine groups are often used as linking chemistries for attaching to or more 
molecules (e.g., attaching a phosphoramidite to another molecule). However, in some 
embodiments of the present invention, amine groups are not used as linking groups, but 
are provided to give a molecule a positive charge. Thus, in some embodiments, the 

1 5 amines are attached to a molecule of interest (e.g., a phosphoramidite), but are not further 
attached to another molecule (e.g., are not attached to a molecule other than the 
phosphoramidite). 

As used herein, the term "dye" refers to a molecule, compound, or substance that 
can provide an optically detectable signal (e.g., fluorescent, luminescent, colorimetric, 

20 etc). For example, dyes include fluorescent molecules that can be associated with nucleic 
acid molecules (e.g., Cy3). 

As used herein, the term "protecting group" refers to a molecule or chemical 
group that is covalentiy attached to a compound to prevent chemical modification of the 
compound or modification of specific chemical groups of the compound. For example, 

25 protecting groups may be attached to a reactive group of a compound to prevent the 
reactive group from participating in chemical reactions including, for example, 
intramolecular reactions. In some cases, a protecting group may act as a leaving group, 
such that when the molecule is added to another compound in a desired synthesis 
reaction, the protecting group is lost, allowing a reactive group to participate in covalent 

30 bonding to the compound. The phosphoramidites of the present invention typically 

contain one or more protective groups prior to their addition to nucleic acid molecules. 
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For example, the reactive phosphate of the phosphoramidite (i.e., the phosphate group 
that is covalently attached to another molecule when the phosphoramidite is added to the 
other molecule) may contain one or more protecting groups. A detailed description of 
phosphoramidites and their addition to nucleic acid molecules is provided Beaucage and 
5 Iyer (Tetrahedron 49: 1925 [1993]), herein incorporated by reference in its entirety. 

As used herein, the terms "solid support" or "support" refer to any material that 
provides a solid or semi-solid structure with which another material can be attached. 
Such materials include smooth supports (e.g., metal, glass, plastic, silicon, and ceramic 
surfaces) as well as textured and porous materials. Such materials also include, but are 

10 not limited to, gels, rubbers, polymers, and other non-rigid materials. Solid supports need 
not be flat. Supports include any type of shape including spherical shapes (e.g., beads). 
Materials attached to solid support may be attached to any portion of the solid support 
(e.g., may be attached to an interior portion of a porous solid support material). Preferred 
embodiments of the present invention have biological molecules such as nucleic acid 

1 5 molecules, charge tags, and proteins attached to solid supports. A biological material is 
"attached'* to a solid support when it is associated with the solid support through a non- 
random chemical or physical interaction. In some preferred embodiments, the attachment 
is through a covalent bond. However, attachments need not be covalent or permanent. In 
some embodiments, materials are attached to a solid support through a "spacer molecule" 

20 or "linking group." Such spacer molecules are molecules that have a first portion that 

attaches to the biological material and a second portion that attaches to the solid support. 
Thus, when attached to the solid support, the spacer molecule separates the solid support 
and the biological materials, but is attached to both. 

As used herein, the term "directly bonded," in reference to two molecules refers 

25 to covalent bonding between the two molecules without any intervening linking group or 
spacer groups that are not part of parent molecules. 

As used herein, the terms "linking group" and "linker group" refer to an atom or 
molecule that links or bonds two entities (e.g., solid supports, oligonucleotides, or other 
molecules), but that is not a part of either of the individual linked entities. 

30 As used herein, the term "reactant," when referring to an agent that is used to 

generate charge-unbalanced molecules from charge-balanced molecules, refers to any 
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agent (e.g., enzyme, chemical, physical device, etc.) that can alter a charge-balanced 
molecule such that a charge-unbalanced molecule is created. 

As used herein, the methods of "capillary electrophoresis," "capillary zone 
electrophoresis," and "microti uids" refer to methods for use in the separation methods of 
5 the present invention. The methods of capillary electrophoresis, capillary zone 

electrophoresis, and microfluids are described in texts and journals including, but not 
limited to, Baker (1995) Capillary Electrophoresis, Wiley-lnterscience, New York, New 
■ York, Weinberger (2000) Capillary Electrophoresis, Second Edition, Academic Press, 

San Deigo, California, Atamna et al., J. Liq. Chromatogr., 13:2517 (1990), Nishi et ah, 

1 0 Anal. Chem., 61 :2434 (1 989), Terabe et ah, Anal, Chem., 56: 111 (1 984), Bousse et al., 
Annu. Rev. Biophys. Biomol, Struct., 29:155 (2000), and U.S. Pat. Nos. 5,916,426, 
5,807,682, 5,703,222, 5,470,705, 5,777,096, and 5,514,543, each of which is herein 
incorporated by reference in its entirety. 

As used herein, the term "kit" refers to any delivery system for delivering 

1 5 materials. In the context of reaction assays, such delivery systems include systems that 
allow for the storage, transport, or delivery of reaction reagents (e.g., oligonucleotides, 
enzymes, etc. in the appropriate containers) and/or supporting materials (e.g., buffers, 
written instructions for performing the assay etc.) from one location to another. For 
example, kits include one or more enclosures (e.g., boxes) containing the relevant 

20 reaction reagents and/or supporting materials. As used herein, the term "fragmented kit" 
refers to a delivery systems comprising two or more separate containers that each contain 
a subportion of the total kit components. The containers may be delivered to the intended 
recipient together or separately. For example, a first container may contain an enzyme 
for use in an assay, while a second container contains oligonucleotides. The term 

25 "fragmented kit" is intended to encompass kits containing Analyte specific reagents 

(ASR's) regulated under section 520(e) of the Federal Food, Drug, and Cosmetic Act, but 
are not limited thereto. Indeed, any delivery system comprising two or more separate 
containers that each contain a subportion of the total kit components are included in the 
term "fragmented kit." In contrast, a "combined kit" refers to a delivery system 

30 containing all of the components of a reaction assay in a single container (e.g., in a single 
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box housing each of the desired components). The term "kit" includes both fragmented 
and combined kits. 

DESCRIPTION OF THE DRAWINGS 

5 Fig. 1 shows the chemical structure of several positively charged heterodimeric 

DNA-binding dyes. 

Fig. 2 is the image generated by a fluorescence imager showing thermal 
degradation of oligonucleotides containing or lacking a 3' phosphate group. 

Fig. 3 depicts the structure of amino-modified oligonucleotides 70 and 74. 
10 Fig. 4a depicts the structure of amino-modified oligonucleotide 75 

Fig. 4b depicts the structure of amino-modified oligonucleotide 76. 
Fig. 5 diagrams the steps leading to the formation of a reactive H-phosphonate 
intermediate. The wavy lines shown linking the various constituents of these 
compositions in this and other drawings represent any organic group that can serve this 
15 linking purpose. 

Fig. 6 diagrams the conversion step leading to the synthesis of V and VI 
compounds. 

Fig. 7 illustrates the creation of an additional compound VII by altering the order 
of addition of the constituents (compared, e.g., with the order leading to the creation of 
20 compound VI, Fig. 6). 

Fig. 8 illustrates several possible modification configurations for a probe 
containing two points of modification. 

Fig. 9 diagrams the process of introducing a reporter group (e.g., a dye) into a 
synthesized compound using H-phosphonate chemistry. 
25 Fig. 10 diagrams the release of a positively-charged tag from an oligonucleotide 

by cleavage in an INVADER assay. 

Fig. 1 1 diagrams five different charge tags, shown as they would be attached to an 
oligonucleotide. 

Fig, 12 diagrams a chiral phosphoramidite. 
30 Fig. 13 diagrams the conversion of a phosphoramidite group to a phosphodiester 

linkage, as during oligonucleotide synthesis. 
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Fig. 14 diagrams the general structures of neutral (A) and positively charged (B) 
phosphoramidites. 

Fig. 15 illustrates several possible combinations in the synthesis of a charge 
balanced probe, using one each of dye, building block, neutral and positively charged 
5 phosphoramidites. 

Fig. 16 diagrams examples of synthesized neutral and positively charged 
phosphoramidites. 

Fig. 17 shows the structures of a group of charge balances oligonucleotide probes 
made using neutral and positively charged phosphoramidites. 
10 Fig. 1 8 is the image generated by a fluorescence imager scan of an IEF gel 

showing the migration of substrates 70, 70dp, 74, 74dp, 75, 75dp, 76 and 76dp. 

Fig. 1 9A provides a schematic showing an arrangement of a target-specific 
INVADER oligonucleotide (SEQ ID NO:2) and a target-specific probe oligonucleotide 
(SEQ ID NO: 1 1) bearing a 5* Cy3 label along a target nucleic acid (SEQ ID NO:49). 
15 Fig. 1 9B is the image generated by a fluorescence imager showing the detection 

of specific cleavage products generated in an invasive cleavage assay using charge 
reversal (/.e., charge based separation of cleavage products). 

Fig. 20 is the image generated by a fluorescence imager that depicts the sensitivity 
of detection of specific cleavage products generated in an invasive cleavage assay using 
20 charge reversal. 

Figs. 21 A and 2 IB are images generated by a fluorescence imager showing the 
products produced using the CLEAVASE A/G and Pfu FEN-1 nucleases and probes 
having or lacking a 5' positive charge; the gel shown in Fig. 21 A was run in the standard 
direction and the gel shown in Fig. 21B was run in the reverse direction. 
, 25 Fig. 22 shows a graph comparing rates of cleavage of charge-modified probes. 

Fig. 23 A shows a schematic diagram of an H-phosphonate (HP)-charge modified 
probe in an invasive cleavage. 

Fig. 23B diagrams the structures of the charge-modified nucleoside (dN) and 
hexanol (HEX) tags. 
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Fig. 24A is an image generated by a fluorescence imager showing the products of 
cleavage of 5 different charge-balanced probes, resolved by gel electrophoresis run in the 
standard direction. 

Fig. 24B is an image generated by a fluorescence imager showing the products of 
5 cleavage of 5 different charge-balanced probes, resolved by gel electrophoresis run in the 
reverse direction. 

Fig. 25 shows a graph comparing the rates of cleavage of five charge balanced 
probes and one fluorescein- labeled control probe. 

Fig. 26 A shows a graph comparing the rates of specific signal accumulation in 
10 reaction performed for different times, ranging from one to twenty four hours. 

Fig 26B shows a graph comparing the amounts of background signal detected in 
reactions performed for different times, ranging from one to twenty four hours. 

Fig. 27 is an image generated by a fluorescence imager showing the products of 
cleavage of four different charge-balanced probes, either alone or combined in a single 
15 lane, resolved by gel electrophoresis run in the reverse direction. 

Fig. 28A shows a schematic diagram of oligonucleotides used for the detection of 
human MCP-1 RNA in a cascading cleavage reaction releasing a charge tag for detection. 

Fig. 28B shows a schematic diagram of oligonucleotides used for the detection of 
human Ubiquitin RNA in a cascading cleavage reaction releasing a charge tag for 
20 detection. 

Fig. 29 is an image generated by a fluorescence imager showing the products of 
INVADER assays for the detection of human MCP-1 and ubiquitin mRNAs alone or 
combined in the same reaction. Products were resolved by gel electrophoresis run in the 
reverse direction, 

. 25. Fig. 30A shows images generated by a fluorescence imager, comparing the 

products of INVADER assays for the detection of human MCP-1 and ubiquitin RNAs 
either alone or combined in the same reaction, and resolved by gel electrophoresis run in 
either the reverse or normal polarity. 

Fig. 3 0B shows images generated by a fluorescence imager, comparing the 
30 products of INVADER assays for the detection of human MCP-1 and ubiquitin RNAs 
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either alone or combined in the same reaction, and resolved by gel electrophoresis run in 
either the reverse or normal polarity. 

Fig. 3 1 shows miceJIar electrokinetic chromatography (MECC) profiles showing 
the effects of sample buffer components on CE resolution. 
5 Fig. 32 shows MECC profiles showing the effects of injection time on CE 

resolution. 

Fig. 33 shows MECC profiles showing the effects of capillary type on CE 
resolution. 

Fig. 34 shows MECC profiles showing the effects of ionic strength of the 
1 0 separation buffer on CE resolution. 

Fig. 35 shows MECC profiles showing the effects of the pH of the separation 
buffer on CE resolution. 

Fig. 36 shows MECC profiles showing the effects of the concentration of Bis-Tris 
borate buffer on CE resolution. 
15 Fig. 37 shows MECC profiles showing the effects of the detergent of the 

efficiency of CE resolution. 

Fig. 38 shows MECC profiles for the four net positively charged tags, 5*-V-Cy3- 
C-3', 5'-V-(dA)-Cy3-C-3*, 5'-V-(dG)-Cy3-C-3\ and 5'-V-(dT)-Cy3-C-3', separated 
individually and as an equimolar mixture of all four molecules. 
20 Fig. 39 shows MECC profiles demonstrating the effect of the use of a fresh 

capillary on the separation of the tag mixture shown in Fig. 38. 

Fig, 40 shows MECC profiles for each of six net positively charged tags separated 
individually or as an equimolar mixture of all six molecules. 

Fig, 41 shows images generated by a fluorescence imager comparing the mobility 
25 of 5'-Tagl-G-3' or 5'-Tag2-G-3' under the conditions of a denaturing gel (A) to the 
mobility under conditions of a native gel (B). 

DESCRIPTION OF THE INVENTION 

As described above, some nucleic acid-based detection assays involve the 
30 elongation and/or shortening of oligonucleotide probes. For example, as described 
herein, the primer-directed, primer-independent, and INVADER-directed cleavage 
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assays, as well as the "nibbling" assay all involve the cleavage (i.e., shortening) of 
oligonucleotides as a means for detecting the presence of a target nucleic sequence. 
Examples of other detection assays that involve the shortening of an oligonucleotide 
probe include the "TaqMan" or nick- translation PCR assay, the assays described in U.S. 
4 5 Patent Nos. 4,775,619 and 5,1 18,605 to Urdea, the catalytic hybridization amplification 

assay described in of Walder and Walder, the cycling probe assay of Duck et al, and the 
target-catalyzed oligonucleotide modification assay of Western. Examples of detection 
% assays that involve the elongation of an oligonucleotide probe (or primer) include the 

SNP detection methods of Orchid Bioscience in U.S. patent 5,952,1 74, the methods of 

10 U.S. patent 5,882,867 to Ullman et al., the polymerase chain reaction (PCR), and the 
ligase chain reaction (LCR). The above examples are intended to be illustrative of 
nucleic acid-based detection assays that involve the elongation and/or shortening of 
oligonucleotide probes and do not provide an exhaustive list. 

Typically, nucleic acid-based detection assays that involve the elongation and/or 

1 5 shortening of oligonucleotide probes require post-reaction analysis to detect the products 
^ of the reaction. It is common that the specific reaction product(s) must be separated from 

the other reaction components, including the input or unreacted oligonucleotide probe. 
.One detection technique involves the electrophoretic separation of reacted and unreacted 
oligonucleotide probes. When the assay involves the cleavage or shortening of a probe, 

20 the unreacted product will be longer than the reacted or cleaved product. When the assay 
involves the elongation of a probe (or primer), the reaction products will be greater in 
length than the unreacted probes. Gel-based electrophoresis of a sample containing 
nucleic acid molecules of different lengths separates these fragments primarily on the 
basis of size. This is due to the fact that, in solutions having a neutral or alkaline pH, 

25 nucleic acids having widely different sizes (i.e., molecular weights) possess very similar 
charge-to-mass ratios and do not separate based solely on charge (Andrews, 
Electrophoresis, 2nd Edition, Oxford University Press (1986), pp. 153-154). The gel 
matrix acts as a molecular sieve and allows nucleic acids to be separated on the basis of 
size and shape (eg., linear, relaxed circular or covalently closed supercoiled circles). 

30 Unmodified nucleic acids have a net negative charge due to the presence of negatively 
charged phosphate groups contained within the sugar-phosphate backbone of the nucleic 
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acid. Typically, the sample is applied to gel near the negative pole and the nucleic acid 
fragments migrate into the gel toward the positive pole with the smallest fragments 
moving fastest through the gel. For gel electrophoresis to effectively resolve different 
fragments (i.e., to make them distinguishable from each other), the differences in size or 
5 shape must be great enough to cause perceptible differences in the rates of migration of 
the different fragments through the gel. 

The present invention provides novel compositions and methods for 
characterizing molecules, including nucleic acid molecules, based on differences in 
charge between starting molecules and molecules that have undergone a modification to 

10 add or remove one or more chemical constituents. For example, the present invention 
provides novel methods and compositions for modifying nucleic acid molecules wherein 
a cleaved or elongated nucleic acid molecule contains a different charge than unmodified 
nucleic acids, allowing for the efficient separation and detection of the reacted molecules. 
While the charge-based separation methods of the present invention are applicable to any 

1 5 number of systems (e.g., separation and characterization of products and intermediates in 
chemical synthesis and drug design) , and are not limited to the use of nucleic acids, the 
following description focuses on nucleic acid applications to illustrate certain preferred 
aspects of the present invention. 

The detailed description of the invention is presented in the following sections: 

20 L Fractionation Of Specific Nucleic Acids By Selective Charge Reversal 
a. Applications in INVADER assay cleavage reactions 
II. Positively Charged Moieties in the Synthesis of Charge-Balanced Molecules 

a. H-phosphonate Chemistry 

b, A New Class of Phosphoramidite Building Blocks 

.25 

I. Fractionation Of Specific Nucleic Acids By Selective Charge Reversal 

The present invention provides a novel means for fractionating nucleic acid, 
fragments on the basis of charge. This novel separation technique is related to the 
observation that positively charged adducts can affect the electrophoretic behavior of 
30 small oligonucleotides because the charge of the adduct is significant relative to charge of 
the whole complex. In addition to the use of positively charged adducts (e.g. , Cy3 and 
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Cy5 fluorescent dyes, the positively charged heterodimeric DNA-binding dyes shown in 
Fig. 1, etc.), the oligonucleotide may contain amino acids (particularly useful amino acids 
are the charged amino acids: lysine, arginine, asparate, glutamate), polypeptides, 
modified bases, such as amino-modified bases, charged ions or metals, a phosphonate 
5 backbone (at all or a subset of the positions), or any other chemical or molecular 
constituent that adds to the net positive charge of the oligonucleotide. In other 
embodiments, as discussed further below, a neutral dye or detection moiety (e.g. , biotin, 
streptavidin, etc.) may be employed in place of a positively charged adduct, in 
conjunction with the use of amino-modified bases and/or a complete or partial 

1 0 phosphonate backbone. 

This observed effect is of particular utility in assays based on the cleavage of 
DNA molecules. Using the INVADER assays described herein as an example, when an 
oligonucleotide is shortened through the action of a CLEAVASE enzyme or other 
cleavage agent, the positive charge can be made to not only significantly reduce the net 

1 5 negative charge, but to actually override it, effectively "flipping" the net charge of the 
labeled entity. This reversal of charge allows the products of target-specific cleavage to 
be partitioned from uncleaved probe by extremely simple means. For example, the 
products of cleavage can be made to migrate towards a negative electrode placed at any 
point in a reaction vessel, for focused detection without gel-based electrophoresis. When 

20 a slab gel is used, sample wells can be positioned in the center of the gel, so that the 
cleaved and uncleaved probes can be observed to migrate in opposite directions. 
Alternatively, a traditional vertical gel can be used, but with the electrodes reversed 
relative to usual DNA gels (i.e., the positive electrode at the top and the negative 
electrode at the bottom) so that the cleaved molecules enter the gel, while the uncleaved 

25 disperse into the upper reservoir of electrophoresis buffer. Similarly, the electrodes of a 
capillary or microchannel device can be configured so that positively charged cleaved 
molecules preferentially enter the capillary or channel for separation. 

An significant benefit of this type of readout is the absolute nature of the partition 
of products from substrates (i.e., the separation may be as high as 100%). This means 

30 that an abundance of uncleaved probe can be supplied to drive the hybridization step of a 
probe-based assay, yet the unconsumed (i.e., unreacted) probe can, in essence, be 
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subtracted from the result to reduce background by virtue of the fact that the unreacted 
probe will not migrate toward the same pole as the specific reaction product. 

Through the use of multiple positively charged adducts, synthetic molecules can 
be constructed with sufficient modification that the normally negatively charged strand is 
5 made nearly neutral. When so constructed, the presence or absence of a single phosphate 
group can mean the difference between a net negative or a net positive charge. This 
observation has particular utility when one objective is to discriminate between 
enzymaticaliy generated fragments of DNA, which generally lack a 3' phosphate, and the 
products of thermal degradation, which generally retain a 3' phosphate (and thus two 

10 additional negative charges, Fig. 2). Examples 1 and 2 demonstrate the ability to separate 
positively charged reaction products from a net negatively charged substrate 
oligonucleotide. As discussed in these examples, oligonucleotides may be transformed 
from net negative to net positively charged compounds. In Example 2, the positively 
charged dye, Cy3 was incorporated at the 5* end of a 22-mer (SEQ ID NO: 1) which also 

15 contained two amino-substituted residues at the 5' end of the oligonucleotide; this 

oligonucleotide probe carries a net negative charge. After cleavage, which occurred 2 
nucleotides into the probe, the following labeled oligonucleotide was released: 5'-Cy3- 
AminoT-AminoT-3' (in addition to unlabeled fragment comprising the remaining 20 
nucleotides of SEQ ID NO:l). This short fragment bears a net positive charge while the 

20 remainder of the cleaved oligonucleotide and the unreacted or input oligonucleotide bear 
net negative charges. 

The present invention contemplates embodiments wherein the specific reaction 
product produced by any cleavage of any oligonucleotide or molecule can be designed to 
carry a net positive charge while the unreacted molecule is charge neutral or carries a net 

25 negative charge. The present invention also contemplates embodiments where the 

released product may be designed to carry a net negative charge while the input nucleic 
acid carries a net positive charge. Depending on the length of the released product to be 
detected, positively charged dyes may be incorporated at the one end of the probe and 
modified bases may be placed along the oligonucleotide such that upon cleavage, the 

30 released fragment containing the positively charged dye carries a net positive charge. 
Amino-modified bases may be used to balance the charge of the released fragment in 
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cases where the presence of the positively charged adduct (e.g., dye) alone is not 
sufficient to impart a net positive charge on the released fragment. In addition, the 
phosphate backbone may be replaced with a phosphonate backbone at a level sufficient to 
impart a net positive charge (this is particularly useful when the sequence of the 
5 oligonucleotide is not amenable to the use of amino-substituted bases); Figs. 3 and 4 
show the structure of short oligonucleotides containing a phosphonate group on the 
second T residue). An oligonucleotide containing a fully phosphonate-substituted 
backbone would be charge neutral (absent the presence of modified charged residues 
bearing a charge or the presence of a charged adduct) due to the absence of the negatively 
1 0 charged phosphate groups. Phosphonate-containing nucleotides (e.g. , 

methylphosphonate-containing nucleotides) are readily available and can be incorporated 
at any position of an oligonucleotide during synthesis using techniques that are well 
known in the art. 

In essence, in these embodiments the invention contemplates the use of charge- 
15 based separation to permit the separation of specific reaction products from the input 
oligonucleotides in nucleic acid-based detection assays. The foundation of this novel 
separation technique is the design and use of oligonucleotide probes (typically termed 
"primers" in the case of PCR) that are "charge balanced" so that upon either cleavage or 
elongation of the probe it becomes "charge unbalanced," and the specific reaction 
20 products may be separated from the input reactants on the basis of the net charge. 

In some embodiments, in the context of assays that involve the elongation of an 
oligonucleotide probe (i.e., a primer), such as is the case in PCR, the input primers are 
designed to carry a net positive charge. Elongation of the short oligonucleotide primer 
during polymerization will generate PCR products that now carry a net negative charge. 
25 The specific reaction products may then easily be separated and concentrated away from 
the input primers using the charge-based separation technique described herein. 
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a. Applications in INVADER assay cleavage reactions 

i. Detection of Specific Nucleic Acid Sequences Using 5' 
Nucleases in an INVADER Directed Cleavage Assay 

5 The present invention finds application in the detection of cleavage products 

generated in the INVADER assay. The INVADER assay provides means for forming a 
nucleic acid cleavage structure that is dependent upon the presence of a target nucleic 
acid and cleaving the nucleic acid cleavage structure so as to release distinctive cleavage 
products. 5' nuclease activity, for example, is used to cleave the target-dependent 

10 cleavage structure and the resulting cleavage products are indicative of the presence of 
specific target nucleic acid sequences in the sample. When two strands of nucleic acid, 
or oligonucleotides, both hybridize to a target nucleic acid strand such that they form an 
overlapping invasive cleavage structure, as described below, invasive cleavage can occur. 
Through the interaction of a cleavage agent (e.g., a 5' nuclease) and the upstream 

1 5 oligonucleotide, the cleavage agent can be made to cleave the downstream 

oligonucleotide at an internal site in such a way that a distinctive fragment is produced. 
Such embodiments have been termed the INVADER assay (Third Wave Technologies) 
and are described in U.S. Patent Nos. 5,846,717, 5,985,557, 5,994,069, 6,001,567, and 
6,090,543 and PCT Publications WO 97/27214 and WO 98/42873, herein incorporated 

20 by reference in their entireties. 

The INVADER assay further provides assays in which the target nucleic acid is 
reused or recycled during multiple rounds of hybridization with oligonucleotide probes 
and cleavage of the probes without the need to use temperature cycling (i.e., for periodic 
denaturation of target nucleic acid strands) or nucleic acid synthesis (i.e., for the 

25 polymerization-based displacement of target or probe nucleic acid strands). When a 
• cleavage reaction is run under conditions in which the probes are continuously replaced 
on the target strand {e.g. through probe-probe displacement or through an equilibrium 
between probe/target association and disassociation, or through a combination 
comprising these mechanisms, (Reynaldo et aL, J. Mol. BioL 97:51 1 [2000])), multiple 

30 probes can hybridize in turn to the same target, allowing multiple cleavages, and the 
generation of multiple cleavage products. 
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By the extent of its complementarity to a target nucleic acid strand, an 
oligonucleotide may be said to define a specific region of said target. In an invasive 
cleavage structure, the two oligonucleotides define and hybridize to regions of the target 
that are adjacent to one another (i.e., regions without any additional region of the target 
5 between them). Either or both oligonucleotides may comprise additional portions that are 
not complementary to the target strand. In addition to hybridizing adjacently, in order to 
form an invasive cleavage structure, the 3* end of the upstream oligonucleotide must 
comprise an additional moiety. When both oligonucleotides are hybridized to a target 
strand to form a structure and such a 3' moiety is present on the upstream oligonucleotide 

10 within the structure, the oligonucleotides may be said to overlap, and the structure may be 
described as an overlapping, or invasive cleavage structure. 

In one embodiment, the 3* moiety of the invasive cleavage structure is a single 
nucleotide. In this embodiment the 3 1 moiety may be any nucleotide (i.e., it may be, but 
it need not be complementary to the target strand). In a preferred embodiment, the 3* 

15 moiety is a single nucleotide that is not complementary to the target strand. In another 
embodiment, the 3' moiety is a nucleotide-like compound (i.e., a moiety having chemical 
features similar to a nucleotide, such as a nucleotide analog or an organic ring compound; 
See e.g., U.S. Pat. No. 5,985,557). In yet another embodiment the 3* moiety is one or 
more nucleotides that duplicate in sequence one or more nucleotides present at the 5' end 

20 of the hybridized region of the downstream oligonucleotide. In a further embodiment, 

the duplicated sequence of nucleotides of the 3' moiety is followed by a single nucleotide 
that is not further duplicative of the downstream oligonucleotide sequence, and that may 
be any other nucleotide. In yet another embodiment, the duplicated sequence of 
nucleotides of the 3* moiety is followed by a nucleoti de-like compound, as described 

25 above. 

The downstream oligonucleotide may have additional moieties attached to either 
end of the region that hybridizes to the target nucleic acid strand. In a preferred 
embodiment, the additional moiety comprises a tag of the present invention. In a 
particularly preferred embodiment, the downstream oligonucleotide comprises a tag or 
30 other moiety at its 5' end (i.e., a 5' moiety). 
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When an overlapping cleavage structure is formed, it can be recognized and 
cleaved by a nuclease that is specific for this structure (i.e., a nuclease that will cleave 
one or more of the nucleic acids in the overlapping structure based on recognition of this 
structure, rather than on recognition of a nucleotide sequence of any of the nucleic acids 
5 forming the structure). Such a nuclease may be termed a "structure-specific nuclease." 
In some embodiments, the structure-specific nuclease is a 5* nuclease. In a preferred 
embodiment, the structure-specific nuclease is the 5' nuclease of a DNA polymerase. In 
another preferred embodiment, the DNA polymerase having the 5' nuclease is synthesis- 
deficient. In another preferred embodiment, the 5' nuclease is a FEN-1 endonucJease. In 

10 a particularly preferred embodiment, the 5' nuclease is thermostable. 

In some embodiments, the structure-specific nuclease preferentially cleaves the 
downstream oligonucleotide. In a preferred embodiment, the downstream 
oligonucleotide is cleaved one nucleotide into the 5' end of the region that is hybridized 
to the target within the overlapping structure. Cleavage of the overlapping structure at 

1 5 any location by a structure-specific nuclease produces one or more released portions or 
fragments of nucleic acid, termed "cleavage products." 

Detection of the cleavage products may be through release of a label Such labels 
may include, but are not limited to one or more of any of dyes, radiolabels such as 32 P or 
35 S, binding moieties such as biotin, mass tags, such as metal ions or chemical groups, 

20 charge tags, such as polyamines or charged dyes, haptens such as digoxgenin, 

luminogenic, phosphorescent or fluorogenic moieties, and fluorescent dyes, either alone 
or in combination with moieties that can suppress or shift emission spectra, such as by 
fluorescence resonance energy transfer (FRET) or collisional fluorescence energy 
transfer. 

25 Examples 1-3 and 9-18, below, demonstrate the use of charge balanced 

oligonucleotides in the INVADER assay. Cleavage results in the production of charge 
unbalanced products which are readily separated from the input molecules. The cleavage 
products are easily detected, providing an efficient and sensitive assay. 
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II. Positively Charged Moieties in the Synthesis of Charge-Balanced DNA Probes 

The present invention provides novel positively charged moieties that may be 
attached to any number of molecules, including nucleic acid molecules. These positively 
5 charged moieties find use in the charge reversal separation methods ("CRE" methods) of 
the present invention. As used herein, the term "positively charged moiety" refers to a 
chemical structure that possesses a net positive charge under the reaction conditions of its 
intended use (e.g., when attached to a molecule of interest under the pH of the desired 
reaction conditions). Positively charged moieties need not always carry a positive 

10 charge. Indeed, in some preferred embodiments of the present invention, the positively 
charged moiety does not carry a positive charge until it is introduced to the appropriate 
reaction conditions. This can also be thought of as "pH-dependent" and "pH- 
independent" positive charges. pH-dependent charges are those that possess the charge 
only under certain pH conditions, while pH-independent charges are those that possess a 

1 5 charge regardless of the pH conditions. 

The positively charged moieties, or "charge tags," when attached to another 
entity, can be represented by the formula: 

X-Y 

20 

where X is the entity (e.g., a solid support, a nucleic acid molecule, etc.) and Y is the 
charge tag. The charge tags can be attached to other entities through any suitable means 
(e.g., covalent bonds, ionic interactions, etc.) either directly or through an intermediate 
(e.g., through a linking group). In preferred embodiments, where X is a nucleic acid 
. 25 molecule, the charge tag is attached to either the 3' or 5' end of the nucleic acid molecule. 

The charge tags may contain a variety of components. For example, the charge 
tag Y can be represented by the formula: 

Y,-Y 2 

30 
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4 



where Y | comprises a chemical component that provides the positive charge to the charge 
tag and where Y2 is another desired component. Y2 may be, for example, a dye, another 
chemical component that provides a positive charge to the charge tag, a functional group 
for attachment of other molecules to the charge tag, a nucleotide, etc. Where such a 
5 structure is attached to another entity, X, either Yj or Y2 may be attached to X. 

X-Y r Y 2 orX-Y 2 -Yi. 

The charge tags are not limited to two components. Charge tags may comprise 
10 any number of desired components. For example, the charge tag can be represented by 
the formula: 

YrY2-Y3-Y n (n ~ any positive integer). 

15 where any of the Y groups comprises a chemical component that provides the positive 
charge to the charge tag and where the other Y groups are any other desired components. 
For example, in some embodiments, the present invention provides compositions of the 
structure: 

20 X-Y,-Y2-Y 3 -Y 4 

where X is an entity attached to the charge tag (e.g., a solid support, a nucleic acid 
molecule, etc.) and where Y| is a dye, Y 2 is a chemical component that provides the 
positive charge to the charge, Y3 is a component containing a functional group that 

25 allows the attachment of other molecules, and Y4 is a second chemical component that 
provides a positive charge. The identity of each of Y1-Y4 can be interchanged (i.e., the 
present invention is not limited by the order of the components). 

The present invention is not limited by the nature of the chemical components that 
provides the positive charge to the charge tag. Such chemical components include, but 

30 are not limited to, amines (primary, secondary, and tertiary amines), ammoniums, and 
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phosphoniums. The chemical components may also comprise chemical complexes that 
entrap or are otherwise associated with one or more positively charged metal ions. 

In preferred embodiments of the present invention, charge tags are attached to 
nucleic acid molecules (e.g., DNA molecules). The charge tags may be synthesized 
5 directly onto a nucleic molecule or may be synthesized, for example, on a solid support 
or in liquid phase and then attached to a nucleic acid molecule or any other desired 
molecule. In some preferred embodiments of the present invention, charge tags that are 
attached to nucleic acid molecules comprise one or more components synthesized by H- 
phosphonate chemistry (described in detail below), by incorporation of novel 
10 phosphoramidites (described in detail below), or a combination of both. For example, 
compositions of the present invention include structures such as: 

[XHY r Y 2 -Y 3 -Y 4 ] 

1 5 where [X] is a nucleic acid molecule and [Y . . .] is a charge tag. In some embodiments, 
Yj is a dye, Y 2 is synthesized using H-phosphonate chemistry and comprises a chemical 
component that provides a positive charge to the charge tag, Y3 is a positively charged 
phosphoramidite, and Y4 is a nucleotide or polynucleotide. Any of the Y components are 
interchangeable with one another. 

20 Such compositions find use in the charge-separation assays of the present 

invention. For example, a probe molecule in the INVADER assay may have a charge tag 
attached to its 5' end. The probe may comprise a net negative charge because of the 
plurality of negatively charge phosphate groups in the oligonucleotide backbone. 
Cleavage of the probe releases the charge tag from the rest of the probe. The released 

25 cleavage fragment, containing the charge tag, carries a net positive charge, while the 
remaining probe oligonucleotide carries a net negative charge. The cleaved fragments 
can then be readily separated from the uncleaved probes and detected, indicating the 
presence of a specific target sequence in the experimental sample. 
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a, H-Phosphonate chemistry. 

As discussed above, one or more components of a charge tag can be synthesized 
using H-phosphonate chemistry. Production of charge tags using the methods described 
herein provides a convenient and flexible modular approach for the design of a wide 
5 variety of charge tags. Since its introduction, solid phase H-phosphonate chemistry (B.C. 
Froehier, Methods in Molecular Biology, 20:33, S. Agrawal, Ed. Humana Press; Totowa, 
New Jersey[1993]) has been recognized as an efficient tool in the chemical synthesis of 
natural, modified and labeled oligonucleotides and DNA probes. Those skilled in the art 
know that this approach allows for the synthesis of the oligonucleotide fragments with a 

10 fully modified phosphodiester backbone (e.g., oligonucleotide phosphorothioates; 
Froechler [1993], supra) or the synthesis of oligonucleotide fragments in which only 
specific positions of the phosphodiester backbone are modified (Agrawal, et aL, 
Proc.Natl.Acad.Sci USA , 85:7079 [1988], Froehler,Tetrahedron Lett. 27:5575[1986], 
Froehier, et aL, Nucl. Acids Res. 16:4831 [1988]). The use of H-phosphonate chemistry 

1 5 allows for the introduction of different types of modifications into the oligonucleotide 
molecule (Agrawal, et aL, Froehler[1986], supra, Letsinger, et aL, J.Am. Chem.Soc, 
1 10:4470 [1988], Agrawal and Zamecnik, Nucl. Acid Res. 18:5419 [1990], Handong, et 
aL, Bioconjugate Chem. 8:49 [1997], Vinogradov, et aL, Bioconjugate Cheni. 7:3 [1995], 
Schultz, ei aL, Tetrachedron Lett. 36:8407 [1995]), however the replacement of the 

20 phosphodiester linkage by the phosphoramidate linkage is one of the most frequent 
changes due to its effectiveness and synthetic flexibility. Froehier and Letsinger were 
among first to use this approach in the synthesis of modified oligonucleotides in which 
phosphodiester linkages were fully or partially replaced by the phosphoramidate 
linkages bearing positively charged groups (e.g., tertiary amino groups; Froehier [1986], 

25 Froehier, et aL, [1 988], and Letsinger, et aL , supra). 

In some embodiments of the present invention, charge tags are generated using H- 
phosphonate chemistry. The charge tags may be assembled on the end of a nucleic acid 
molecule or may be synthesized separately and attached to a nucleic acid molecule. Any 
suitable phosphorylating agent may be used in the synthesis of the charge tag. For 

30 example, the component to be added may contain the structure: 
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A-B-P 

where A is a protecting group, B is any desired functional group (e.g., a functional group 
that provides a positive charge to the charge tag), and P is a chemical group containing 
5 phosphorous. In preferred embodiments, B comprises a chemical group that is capable of 
providing a positive charge to the charge tag. However, in some embodiments B is a 
functional group that allows post-synthetic attachment of a positively charged group to 
the charge tag. 

The process of the synthesis of the charge-balanced charge tag containing (CRE) 
1 0 probes using H-phosphonate chemistry can be divided into steps. 

1 . In the first step, the specific DNA sequence is synthesized using a 
standard automated phosphoramidite protocol (a reporter molecule (dye) may be 
introduced into the molecule at this stage using phosphoramidite or H-phosphonate 
chemistry, or it can be attached to the probe after the completion of other steps of the 

15 modification procedure using any of the standard post-synthetic labeling protocols). 

2. In the second step, a modification procedure is performed using solid- 
phase H-phosphonate chemistry. The DNA probe, suspended on the solid support, is 
coupled to an appropriate H-phosphonate monomer in the presence of an appropriate 
activating reagent (e.g., pivaloyl chloride). This step leads to the formation of the 

20 reactive H-phosphonate intermediate (Fig. 5). 

Group "Z" in Fig. 5 represents any organic group (with any other functional 
groups present protected as necessary for protocols of chemical synthesis of 
oligonucleotides). Group "Z" may optionally contain other DMT-protected hydroxyl 
groups (or other appropriately protected functional groups), to which additional 

25 monomeric units (e.g., H-phosphonate, phosphoramidite, etc.) can be attached, either 

covalently or noncovalently (e.g. thorough complex formation). Wavy lines in Fig. 5 and 
other figures in this patent disclosure, e.g., as shown linking controlled pore glass (CPG) 
and the DNA molecule (and which may link any two entities of these compositions), 
represent any kind of atom or organic group that can serve these purposes. 
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This step should be performed on a DNA synthesizer with H-phosphonate 
adaptation or should be performed manually according to a solid phase H-phosphonate 
coupling protocol. 

A subsequent step of the modification procedure involves the conversion of the 
5 intermediate H-phosphonate into the phosphoramidate-bearing group(s) that can 

introduce positive charges into the composition. Usually, this conversion is performed 
with the help of an Atherton-Tod reaction, in which the intermediate H-phosphonate III 
or IV is treated with a solution of an appropriate primary or secondary amine, carbon 
tetrachloride (or other reagent(s) leading to the same type of transformation in which 

1 0 phosphoramidate bond between the amine used in the reaction and phosphorus atom is 
formed) in anhydrous aprotic solvent(s), preferably pyridine, mixture of pyridine and 
acetonitrile, or pyridine and tetrahydrofuran. Fig. 6 shows the conversion leading to the 
synthesis of V and VI. 

The structure of the monomer ic H-phosphonate may optionally contain additional, 

1 5 appropriately protected functional groups (e.g., amino, hydroxy I, mercapto or carboxy 
groups) that can be used in other steps of the synthesis and modification of the probe 
containing the charge tag. 

If the modification procedure involves multiple coupling steps performed using 
H-phosphonate chemistry or phosphoramidite chemistry, the H-phosphonate monomer(s) 

20 used in the modification procedure should contain selectively protected hydroxy! group, 
preferably with the DMT protecting group, while other functional groups should be 
protected with protecting groups compatible with the protocol of the chemical synthesis 
of oligonucleotides. 

It is important to note that the possibility of the use of the intermediate materials I 
25 or II significantly increases the synthetic flexibility of the modification procedure (and 
helps to create a broad variety of charge-balanced probes). By altering the sequence of 
coupling of the H-phosphonate reagents and another reagents (e.g. reporter molecules) to 
the synthesized DNA sequence, different probes (CRE-V1) can be synthesized. The 
probes generate fragments of varying polarity and/or mobility upon cleavage in, for 
30 example, an INVADER assay. The synthetic flexibility of the H-phosphonate approach 
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can be conveniently illustrated on the example of the synthesis of the multiple labeled 
CRE probe. 

Introduction of multiple points of modification with moieties bearing positive 
charge(s) may be desired, in order to compensate negative charges introduced into the 
5 probe by another group (e.g., a dye bearing multiple negative charges or other groups). 

The synthesis of CRE probes containing only two points of modification, one 
introducing a positively charged moiety and one introducing a neutral group for structure 
modulation, and having only one dye that does not alter the net charge (e.g., Cy3 dye 
introduced using phosphoramidite chemistry), is illustrated in Fig. 8. 
10 As it can be seen, the synthetic procedure in which only one reporter group, one 

type of H-phosphonate monomer and two different amines were used, can generate six 
different charge-balanced CRE probes. The number of possible structural variations of 
the synthesized charge-balanced CRE probes using a single reporter molecule (e.g. Cy3) 
can be significantly expanded if the synthesis is performed using one of two structurally 
1 5 different H-phosphonate monomers, one of two different amines for introducing positive 
charge, and one of two d ifferent amines for structure modulation. The use of those 
reagents will lead to the creation of four different modifications introducing positive 
charge and four different structure modulating modifications. 

In the discussed example, the structure of a charge-balanced CRE probe should 
20 contain one position occupied by a reporter molecule (e.g. Cy3), one position occupied 

by a modification introducing positive charge and (optionally) one position occupied by a 
structure- modulating modification. A total 96 different charge-balanced CRE probes can 
be synthesized using the above mentioned reagents. 

It is clear that a large number of possible structural permutations are achieved 
. 25 with the use of only seven different reagents, allowing for the selection of the structural 
arrangement that will offer a particular desired probe performance (i.e., assay 
performance and/or the desired electrophoretic mobility of the cleaved positively charged 
fragments). The same set of reagents can be used in the synthesis of charge balanced 
probes that do not contain any neutral modifications (e.g., as used for structure 
30 modulation) or that contain multiple points at which structure-modulating modification 
can be added. This further expands the number of possible structures of charge-balanced 
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probes that can be synthesized using a relatively small (seven in the discussed example) 
number of reagents. It is important to note that reporter groups can be also introduced 
into CRE probes using H-phosphonate chemistry. Fig. 9 diagrams a process in which an 
activated H-phosphonate of a reporter molecule (e.g., a dye) reacts with an available 
5 hydroxyl group of an oligonucleotide attached to a solid phase, leading to the formation 
of an intermediate H-phosphonate IVa, which is subsequently converted to a 
phosphoramidate-derivative using an appropriate primary or secondary amine and the 
chemical reaction described above. 

In all cases, these procedures lead to the attachment of a specific structure of 

10 charged organic moiety (described later as COM <+) ) to a DNA sequence. As a result, a 
positively charged fragment (positively charged Tag; called later "PCT") cleaved in the 
enzymatic process, will be composed of one nucleotide and the COM (+) , and will have the 
desired net positive charge (Fig. 10). 

As an example illustrating the use of H-phosphonate chemistry in the synthesis of 

15 the CRE probes, the synthesis of five different charge-balanced CRE probes was 

performed (Fig. 1 1). All synthesized charge-balance probes were tested in an INVADER 
assay. It was found that the cleaved PCTs have different electrophoretic mobility under 
the conditions of reverse capillary electrophoresis. 

The use of H-phosphonates in the modification of CRE probes is associated with the 
20 generation of a new center of chirality at the tetracoordinated phosphoramidate 

phosphorus atom (Fig. 12). The use of chiral (optically active) and more sterically bulky 
H-phosphonate monomers (e.g. dT, dA, dC, dG H-phosphonates) can lead to the 
formation of diastereoisomers, which will have different chromatographic and 
electrophoretic properties. When relatively small and achiral H-phosphonate monomer 
25 was used (e.g., DMT-protected H-phosphonate of 1,6-hexanediol), the formation of the 
stereoisomers was not detectable under either reverse phase HPLC and capillary 
electrophoresis conditions. However, diastereoisomeric forms of the larger synthesized 
materials can be detected as separate peaks in the analytical RP HPLC profiles, and in the 
CE profiles of both the intact CRE probes and the positively charged products of 
30 enzymatic cleavage. The separations between diastereoisomers under those conditions 
can vary and can depend on the nature of the groups introduced in the modification step. 
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Introduction of multiple points (n) of modification using H-phosphonate reagents leads to 
the formation of 2 n diastereoisomers, which may or may not be separated under the 
conditions used for the probe purification, analysis or under the conditions of the CRE 
experiments. The separation of the diastereoisomers can be disadvantageous in situations 
5 where probes will be used in a multiplex assay. Formation of the diastereoisomeric 
forms of the charge balanced CRE probes was observed in all cases in which H- 
phosphonates of the 5'-DMT protected deoxynucleosides were used. 

In one case, (dA H-phosphonate, amine used in the conversion of the intermediate 
H-phosphonate into the phosphoramidate: H 2 NCH 2 CH2NMe2) the separation of the 

10 diastereoisomers under reverse phase HPLC conditions (Ol 8 column) allowed 
separation of the isomers. Analysis of the isolated fractions by mass spectrometry 
revealed that the materials had identical molecular weight, corresponding to that of the 
desired product. Therefore, if a step of purifying the individual diastereoisomers is not 
intended, or when complete separation is not possible, the use of achiral H-phosphonates 

15 as a building block in the synthesis of the CRE probes for such system may be preferred 
to the use of chiral H-phosphonates. However, in cases when the separation of the 
diastereoisomers in pure form is possible (e.g., by reverse phase HPLC), the individual 
diastereoisomers can be used as separable tags in CRE assays, further expanding the 
diverse library of the H-phosphonate-generated CRE probes. 

20 In some embodiments of the present invention, an H-phosphonate of Cy3 is used 

to directly introduce a charge-bearing unit into a charge tag. For example, use of an H- 
phosphonate of Cy3 can provide a charge tag containing the structure: 



DNA P O Cy3 OH 

NR 
Ri 

25 where any desired amine can be readily incorporated into the position NR. This allows, 
for example, the production of a palette of different charge tags that will provide different 
mobility in separation assays. 
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b. A New Class of Phosphoramidite Building Blocks: ,f Positively Charged 
Phosphoramidites" (PCP) and "Neutral Phosphoramidites" (NP). 
Positively charged phosphoramidites (PCP) and neutral phosphoramidites (NP) 

represent a new class of phosphoramidite building blocks designed to introduce both 
5 positive charge and structure modulation into the synthesized charge-balanced CRE 

probe. 

A standard coupling protocol using phosphoramidite reagents is associated with 
the introduction into the growing molecule, of one negative charge per coupling step, due 
to the formation of the phosphodiester linkage (Fig. 13). In the synthesis of charge- 

1 0 balanced CRE probes in which a specific ratio of negative and positive charges should be 
maintained, the introduction of additional negative charges can represent a disadvantage. 
To eliminate this disadvantage, new types of phosphoramidites were designed to either 
introduce a net positive charge(s) at each coupling step (positively charged 
phosphoramidites, PCPs), or to introduce no extra charge (neutral phosphoramidites, 

15 NPs) into the synthesized CRE probe. Fig. 14 shows general structures of the PCP and 
NP phosphoramidites in some embodiments of the present invention. 

The positively charged group (Y*) represents any organic group that can exist in a 
positively charged form, preferably primary, secondary or tertiary amines. Modification 
with the introduction of quarternary ammonium groups or other organic positively 

20 charged groups is also contemplated. 

Both PCPs and NPs can be used in combination with other phosphoramidite 
building blocks (PBBs), which introduce one negative charge per coupling, but which can 
serve as structure modulating factors. Diversification of the structures of the PCPs and 
NPs can also serve as factors for the structure modulation of the synthesized CRE probe. 

25 This approach allows for the synthesis of a large variety of the charge-balanced CRE 

probes using a standard phosphoramidite coupling protocol for oligonucleotide synthesis. 

For example, Fig. 15 illustrates possible combinations in the synthesis of the 
charge-balanced CRE probe when the synthesis is performed with the use of one dye 
phosphoramidite (DP), which introduces zero net charge (e.g., Cy3 phosphoramidite), 

30 PBB, which introduces one negative charge, one NP introducing zero net charge, and one 
PCP, which introduces one net positive charge. As shown in Fig. 15, due to the large 
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number of positional permutations possible in the design of the probe structure, a large 
variety of charge-balanced structures can be synthesized using only four reagents. 

While Fig. 15 illustrates the synthesis of the charge-balanced CRE probes in which 
the reporter molecule (Cy3) is attached directly to the oligonucleotide sequence, other 
5 structural permutations in which the reporter molecule can occupy other positions are 
also contemplated. 

Therefore, this approach creates a unique opportunity to synthesize a large 
number of the charge-balanced CRE probes using only one reporter molecule. For 
example, Fig, 13 presents an embodiment in which a dye that does not introduce any net 

10 charge (e.g., Cy3 phosphoramidite) was used in probe synthesis. This does not preclude 
the use other dyes in the synthesis of a different set of charge-balanced CRE probes for 
use, e.g., in multiplex detection systems using, for example, the INVADER Assay. It is 
also worth noting that, in contrast to the H-phosphonate approach, the use of the new type 
of phosphoramidites does not lead to the creation of new centers of chirality. 

15 In an additional embodiment, the H-phosphonates and the phosphoramidites of 

the present invention are used in combination, e.g., in the synthesis of the specifically 
modified charge-balanced CRE probes. Fig. 16 shows an example of the synthesized 
neutral phosphoramidite and positively charged phosphoramidite, and Fig. 17 shows the 
structures of a set of charge-balanced CRE probes that were synthesized utilizing PCPs 

20 and NPs. 

Commercially available phosphoramidite of the 18-atom linker (polyethylene 
glycol derivative; Glen Research; Cat.# 10-1918-90) was used as a building block 
phosphoramidite used for structure modulation, (indicated in Fig. 17 as "18AL"). 

Linkers of different lengths and of different chemical natures can be used as 
. 25 structure modulating reagents. 
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10 



The present invention also provides new synthetic methods using phosphoramites 
to generate charge tags containing a unit with a charge group and a phosphate group. For 
example, as described above, H-phosphonate chemistry can be used to add a charged unit 
onto a nucleic acid structure: 

O 



DNA OH DNA O— ^ 

R R 

(where X is one or more additional components of the charge tag and the R groups are 
any other desired chemical groups). The same structure may be generated using 
phosphoramidite addition by first adding the phosphoramite, then using a Michaelis- 
Arbuzov reaction in the presence of, for example, an amine: 



?R 



o 

DNA OH DNA P — ODMT +~ DNA O P- 

^ i 



° CE ^P ODMT 

OR h (or Br 2 ) 



15 The above methods of generating charge tags allow an extremely wide variety of 

charge tags to be made. This variety of options allows for multiplex detection methods. 
For example, in the context of the INVADER assay, a charge tag attached to a probe 
oligonucleotide could have three components: 

20 34probe]-54YrY 2 -Y 3 ] 

where Yi is one of any number of dyes, Y 2 is one of any number of groups containing a 
positive charges, and Y 3 is one of four nucleotides (e.g., not complementary to the target 
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nucleic acid). If four different dyes and four different charged groups are used, this 
would introduce 4x4x4, or 64 distinct charge tags that could be individually resolvable 
using the methods described herein (e.g., rnicrofluidics). By adding additional 
components or additional choices at each component, hundred to thousands or more 
5 distinct charge tags can be made and used in multiplex analyses. 

EXAMPLES 

The following examples serve to illustrate certain preferred embodiments and 
aspects of the present invention and are not to be construed as limiting the scope thereof. 

10 In the disclosure which follows, the following abbreviations apply: Afu 

(Archaeoglobus fulgidus); Mth (Methanobacterium thermoautotrophicum); Mja 
(Methanococcus jannaschii); Pfu (Pyrococcus furibsus); Pwo (Pyrococcus woesei); Taq 
(Thermus aquaticus); Taq DNAP, DNAPTaq, and Taq Pol I (T. aquaticus DNA 
polymerase I); DNAPStf (the Stpffel fragment of DNAPTaq); DNAPEcl (E. coli DNA 

15 polymerase I); Tth (Thermus thermophilus); Ex. (Example); Fig. (Figure);°C (degrees 
Centigrade); g (gravitational field); hr (hour); min (minute); olio (oligonucleotide); rxn 
(reaction); vol (volume); w/v (weight to volume); v/v (volume to volume); BSA (bovine 
serum albumin); CTAB (cetyltrimethylammonium bromide); HPLC (high pressure liquid 
chromatography); DNA (deoxyribonucleic acid); p (plasmid); uJ (microliters); mi 

20 (milliliters); u.g (micrograms); mg (milligrams); M (molar); mM (milHMolar); \xM 
(microMolar); pmoles (picomoles); amoles (attomoles); zmoles (zeptomoles); 
nm (nanometers); kdal (kilodaltons); OD (optical density); EDTA (ethylene diamine 
tetra-acetic acid); FITC (fluorescein isothiocyanate); SDS (sodium dodecyl sulfate); 
NaP0 4 (sodium phosphate); NP-40 (Nonidet P-40); Tris (tris(hydroxymethyl)- 

25 aminomethane); PMSF (phenylmethylsulfonylfluoride); TBE (Trfs-Borate-EDTA, i.e., 
Tris buffer titrated with boric acid rather than HC1 and containing EDTA); 
PBS (phosphate buffered saline); PBS (phosphate buffered saline containing 1 mM 
PMSF); PAGE (polyacrylamide gel electrophoresis); Tween (polyoxyelhylene-sorbitan); 
ATCC (American Type Culture Collection, Rockville, MD); Coriell (Coriell Cell 

30 Repositories, Camden, NJ); DSMZ (Deutsche Sammlung von Mikroorganismen und 
Zellculturen, Braunschweig, Germany); Sigma (Sigma Chemical Company, St. Louis, 
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MO); MJ Research (MJ Research, Watertown,MA); Novagen (Novagen, Inc., Madison, 
WI); Perkin Elmer (Perkin Elmer Instruments, Norwalk, CT); Promega (Promega Corp. 
Madison, Wl); Clonetech (Clonetech, Palo Alto, CA); Pharmacia (Pharmacia, 
Piscataway, NJ); Hitachi (Hitachi Instruments Inc. San Jose, CA Qiagen (Qiagen, Inc. 
5 Valencia, CA); Biol 01 (Bio 101 Inc. Vista, CA); Aidrich (Aidrich Chemical Company 
Inc. Milwaukee, WI); VWR (VWR Scientific Products, West Chester, PA); Glen 
Research (Glen Research Corporation, Sterling VA); PE Biosystems (PE/ Applied 
Biosystems, Foster City, CA); Wheaton (Wheaton Science Products, Millville, NJ); EM 
Science' (EM Science, Gibbstown NJ); Gelman (Gelman Science, Ann Arbor, MI); 

10 Becton Dickensen (Becton Dickensen Labware, Bedford, MA); Buchi (Biichi Analytical, 
Switzerland); Chemglass (Chemglass Inc. Vineland, NJ); Dot Scientific (Dot Scientific 
Inc. Burton, MI); Eppendorf Scientific (Eppendorf Scientific Inc. Westbury, NY); 
Applied Biosystems (Applied Biosystems, Foster City, CA); lnvitrogen (Invitrogen 
Corporation, Carlsbad, CA); Ambion (Ambion Inc. Austin, TX); Gibco BRL (Life 

15 Technologies, Gaithersburg, MD); USB (US Biochemical, Cleveland, OH); Calbiochcm 
(Calbiochern, San Diego, CA). 

EXAMPLE 1 
Detection Of DNA By Charge Reversal 

20 The detection of specific targets is achieved in the INVADER-directed cleavage 

assay by the cleavage of a probe oligonucleotide. The cleaved probe may be separated 
from the uncleaved probe using the charge reversal technique described below. This 
novel separation technique is related to the observation that positively charged adducts 
can affect the electro phoretic behavior of small oligonucleotides because the charge of 
. 25 the adduct is significant relative to charge of the whole complex. Observations of 

aberrant mobility due to charged adducts have been reported in the literature, but in all 
cases found, the applications pursued by other scientists have involved making 
oligonucleotides larger by enzymatic extension. As the negatively charged nucleotides 
are added on, the positive influence of the adduct is reduced to insignificance. As a 
30 result, the effects of positively charged adducts have been dismissed and have received 
little notice in the existing literature. 
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Through the use of multiple positively charged adducts, synthetic molecules can 
be constructed with sufficient modification that the normally negatively charged strand is 
made nearly neutral. When so constructed, the presence or absence of a single phosphate 
5 group can mean the difference between a net negative or a net positive charge. This 
observation has particular utility when one objective is to discriminate between 
enzymatically generated fragments of DNA, which generally lack a 3* phosphate, and the 
products of thermal degradation, which generally retain a 3* phosphate (and thus two 
additional negative charges). 

10 

a) Characterization Of The Products Of Thermal Breakage Of DNA 
Oligonucleotides 

Thermal degradation of DNA probes results in high background that can obscure 
signals generated by specific enzymatic cleavage, decreasing the signai-to-notse ratio. 

1 5 To better understand the nature of DNA thermal degradation products, the 5' tetrachloro- 
fluorescein (TET)-Iabeied oligonucleotides 78 (SEQ ID NO:3) and 79 (SEQ ID NO:4) 
(100 pmole each) were incubated in 50 p.1 10 mM NaC0 3 (pH 10.6), 50 mM NaCI at 
90°C for 4 hours. To prevent evaporation of the samples, the reaction mixture was 
overlaid with 50 pi of CHILLOUT liquid wax (MJ Research). The reactions were then 

20 divided in two equal aliquots (A and B). Aliquot A was mixed with 25 fxl of methyl 
violet loading buffer and Aliquot B was dephosphorylated by addition of 2.5 pi of 100 
mM MgCl 2 and 1 pi of 1 unit/pi Calf Intestinal Alkaline Phosphatase (CIAP) (Promega), 
with incubation at 37°C for 30 min. after which 25 p.1 of methyl violet loading buffer was 
added. One microliter of each sample was resolved by electrophoresis through a 12% 

25 po 1 y aery 1 amide denaturing gel and imaged as described in Example 21; a 585 nm filter 
was used with the FMBIO Image Analyzer. The resulting imager scan is shown in Fig. 2. 

In Fig. 2, lanes 1-3 contain the TET-labeled oligonucleotide 78 and lanes 4-6 
contain the TET-labeled oligonucleotide 79. Lanes 1 and 4 contain products of reactions 
that were not heat treated. Lanes 2 and 5 contain products from reactions that were heat 

30 treated and lanes 3 and 6 contain products from reactions that were heat treated, then 
subjected to phosphatase treatment. 
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As shown in Fig. 2, heat treatment causes significant breakdown of the 5'-TET- 
labeled DNA, generating a ladder of degradation products (Fig. 2, lanes 2, 3, 5 and 6). 
Band intensities correlate with purine and pyrimidine base positioning in the 
oligonucleotide sequences, indicating that backbone hydrolysis may occur through 
5 formation of abasic intermediate products that have faster rates for purines than for 
pyrimidines (Lindahl and KarlstrSm, Biochem., 12:5151 [1973]). 

Dephosphorylation decreases the mobility of all products generated by the 
thermal degradation process, with the most pronounced effect observed for the shorter 
products (Fig. 2, lanes 3 and 6). This demonstrates that thermally degraded products 

10 possess a 3' end terminal phosphoryl group that can be removed by dephosphorylation 

with C1AP. Removal of the phosphoryl group decreases the overall negative charge by 2. 
Therefore, shorter products that have a small number of negative charges are influenced 
to a greater degree upon the removal of two charges. This leads to a larger mobility shift 
in the shorter products than that observed for the larger species. 

15 The products generated by the CLEAVASE enzyme do not contain this additional 

3' phosphate. Therefore, if an assay is designed such that the desired reaction products 
contain one or two positive charges, similar thermal breakdown products would be 
neutral or negative. This allows for easy separation of product from background via the 
reverse charge methods described below. 

20 

b) Dephosphorylation Of Short Ammo-Modified Oligonucleotides Can 
Reverse The Net Charge Of The Labeled Product 

To demonstrate how oligonucleotides can be transformed from net negative to net 
positively charged compounds, the four short amino-modified oligonucleotides labeled 

25 70, 74, 75 and 76 and shown in Figs. 3-4 were synthesized. All four modified 

oligonucleotides possess Cy3 dyes positioned at the 5'-end, which individually are 
positively charged under reaction and isolation conditions described in this Example. 
Compounds 70 and 74 contain two amino modified thymidines that, under reaction 
conditions, display positively charged R-NH3 4 * groups attached at the C5 position through 

30 a C10 or €5 linker, respectively. Because compounds 70 and 74 are 3'-end 

phosphorylated, they consist of four negative charges and three positive charges. 
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Compound 75 differs from 74 in that the internal C 6 amino modified thymidine phosphate 
in 74 is replaced by a thymidine methyl phosphonate. The phosphonate backbone is 
uncharged and so there are a total of three negative charges on compound 75. This gives 
compound 75 a net negative one charge. Compound 76 differs from 70 in that the 
5 internal amino modified thymidine is replaced by an internal cytosine phosphonate. The 
pK a of theN3 nitrogen of cytosine can be from 4 to 7. Thus, the net charges of this 
compound, can be from -1 to 0 depending on the pH of the solution. For the simplicity of 
analysis, each group is assigned a whole number of charges, although it is realized that, 
depending on the pK u of each chemical group and ambient pH, a real charge may differ 

10 from the whole number assigned. It is assumed that this difference is not significant over 
the range of pHs used in the enzymatic reactions studied here. 

Dephosphorylation of these compounds, or the removal of the 3' end terminal 
phosphoryl group, results in elimination of two negative charges and generates products 
that have a net positive charge of one. In this experiment, the method of isoelectric 

15 focusing (IEF) was used to demonstrate a change from one negative to one positive net 
charge for the described substrates during dephosphorylation. 

Substrates 70, 74, 75 and 76 were synthesized by standard phosphoramidite 
chemistries and deprotected for 24 hours at 22°C in 14 M aqueous ammonium hydroxide 
solution, after which the solvent was removed in vacuo. The dried powders were 

20 resuspended in 200 u.1 of IhO and filtered through 0.2 u.m filters. The concentration of 
the stock solutions was estimated by UV-absorbance at 261 nm of samples diluted 200- 
fold in H2O using a spectrophotometer (Spectronic Genesys 2, Milton Roy, Rochester, 
NY), 

Dephosphorylation of compounds 70 and 74, 75 and 76 was accomplished by 
25 treating 10 \x\ of the crude stock solutions (ranging in concentration from approximately 

0.5 to 2 mM) with 2 units of CIAP in 100 \x\ of CIAP buffer (Promega) at 37°C for 1 

hour. The reactions were then heated to 75°C for 15 min. in order to inactivate the CIAP. 

For clarity, dephosphorylaled compounds are designated 'dp*. For example, after 

dephosphorylation, substrate 70 becomes 70dp. 
30 To prepare samples for IEF experiments, the concentration of the stock solutions 

of substrate and dephosphorylaled product were adjusted to a uniform absorbance of 8.5 
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x 10" 3 at 532 nm by dilution with water. Two microliters of each sample were analyzed 
by 1EF using a PhastSystem electrophoresis unit (Pharmacia) and PhastGe] IEF 3-9 
media (Pharmacia) according to the manufacturer's protocol. Separation was performed 
at 15°C with the following program: pre-run; 2,000 V, 2.5 mA, 3.5 W, 75 Vh; load; 200 
5 V, 2.5 mA, 3.5 W, 15 Vh; run; 2,000 V; 2.5 mA; 3.5 W, 130 Vh. After separation, 

samples were visualized by using the FMBIO Image Analyzer (Hitachi) fitted with a 585 
nm filter. The resulting imager scan is shown in Fig. 1 8. 

Fig. 18 shows results of IEF separation of substrates 70, 74, 75 and 76 and their 
dephosphorylated products. The arrow labeled "Sample Loading Position" indicates a 

10 loading line, the sign shows the position of the positive electrode and the sign 
indicates the position of the negative electrode. 

The results shown in Fig. 1 8 demonstrate that substrates 70, 74, 75 and 76 
migrated toward the positive electrode, while the dephosphorylated products 70dp, 74dp, 
75dp and 76dp migrated toward negative electrode. The observed difference in mobility 

15 direction was in accord with predicted net charge of the substrates (minus one) and the 
products (plus one). Small perturbations in the mobilities of the phosphorylated 
compounds indicate that the overall pis vary. This was also true for the 
dephosphorylated compounds. The presence of the cytosine in 76dp, for instance, moved 
this compound further toward the negative electrode, which was indicative of a higher 

20 overall pi relative to the other dephosphorylated compounds. It is important to note that 
additional positive charges can be obtained by using a combination of natural amino 
modified bases (70dp and 74dp) along with uncharged methylphosphonate bridges 
(products 75dp and 76dp). 

The results shown above demonstrate that the removal of a single phosphate 

25 group can flip the net charge of an oligonucleotide to cause reversal in an electric field, 
allowing easy separation of products, and that the precise base composition of the 
oligonucleotides affect absolute mobility but not the charge-flipping effect. 
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EXAMPLE 2 

Detection Of Specific Cleavage Products In The INVADER-Directed Cleavage 
Reaction By Charge Reversal 
In this Example the ability to isolate products generated in the INVADER- 
5 directed cleavage assay from all other nucleic acids present in the reaction cocktail using 
charge reversal is demonstrated. 
Enzymes for Cleavage Assays 

The CLEAVASE A/G enzyme was prepared as described in U.S. Patent 
6,090,606, and PCT application WO 98/23774 (herein incorporated by reference in their 

10 entireties); Afu FEN 1 and Pfii FEN1 were isolated as described in WO 98/23774. Two 
other enzymes used in these studies, CLEAVASE TthAKK enzyme and Ave FEN1 
nuclease, were produced as described in the following sections. 
Cloning and Expression of Cleavase TthAKK 
Initial TthPoi Isolation 

1 5 Genomic DNA was prepared from 1 vial of dried Thermus thermophilus strain 

HB-8 from ATCC (ATCC #27634). The DNA polymerase gene was amplified by PCR 
using the following primers: 5*-CACGAATTCCGAGGCGATGCTTCCGCTC-3' (SEQ 
ID NO:5) and 5-TCGACGTCGACTAACCCTTGGCGGAAAGCC-3' (SEQ ID NO:6). 
The resulting PCR product was digested with EcoKL and Sail restriction endonucleases 

20 and inserted into EcoRUSal I digested plasmid vector pTrc99G . The pTrc99G vector 
was created by modification of the pTrc99A vector (Pharmacia) to remove the G at 
position 270 of the pTrc99A map. To this end, pTrc99A plasmid DNA was cut with 
Ncol and the recessive 3' ends were filled-in using the Klenow fragment of E.coli 
polymerase I in the presence of all four dNTPs at 37°C for 1 5 min. After inactivation of 

25 the Klenow fragment by incubation at 65°C for 1 0 min, the plasmid DNA was cut with 
EcoRl and the ends were again filled-in using the Klenow fragment in the presence of all 
four dNTPs at 37°C for 15 min. The Klenow fragment was then inactivated by 
incubation at 65°C for 1 0 min. The plasmid DNA was ethanol precipitated, 
recircularized by ligation, and used to transform E.coli JM109 cells (Promega). The 

30 pTrc99G plasmid DNA was isolated from single colonies, and deletion of the G at 
position 270 (by reference to the pTrc99A map) was confirmed by DNA sequencing. 
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Insenion of the Tth DNA into this vector as described above created the plasmid 
pTrcTth-1 . This Tth polymerase construct is missing a single nucleotide that was 
inadvertently omitted from the 5' oligonucleotide, resulting in the polymerase gene being 
out of frame. This mistake was corrected by site specific mutagenesis of pTrcTth-1 using 
5 the TRANSFORMER Site Directed Mutagenesis Kit (Clontech) according to the 
manufacturer's instructions, and the following oligonucleotide: 
5-GCATCGCCTCGGAATTCATGGTC-3' (SEQ ID NO:7), to create the plasmid 
pTrcTth-2. The protein and the nucleic acid sequence encoding the protein are referred 
to as TthPol, and are listed as SEQ ID NOS:8 and 9 respectively. 

10 

Modified TthPol Gene: Tth DN 

The Tth DN construct was created by mutating the TthPol-2 described above. 
The sequence encoding an aspartic acid at position 787 was changed by site-specific 
mutagenesis as described above to a sequence encoding asparagine. Mutagenesis of 
1 5 pTrcTth-2 with the following oligonucleotide: 

5'-C AGGAGGAGCTCGTTGTGGACCTGGA-3' (SEQ ID NO: 1 0) was performed to 
create the plasmid pTrcTthDN. The mutant protein, termed Tth DN, and protein coding 
nucleic acid sequence are SEQ ID NOS:l 1 and 12, respectively. 

20 Tth DN HT 

A six-amino acid histidine tag (his-tags) was added onto the carboxy terminus of 
Tth DN. The site-directed mutagenesis was performed using the TRANSFORMER Site 
Directed Mutagenesis Kit (Clontech) according to the manufacturer's instructions. The 
mutagenic oligonucleotides used on the plasmid pTth DN was sequence 
. 25 S'-TGCCTGCAGGTCGACGCTAGCTAGTGGTGGTGGTGGTGGTGACCCTTGGCG 
GAAAGCC-3' (SEQ ID NO: 13), sequence 136-037-05. The selection primer Trans 
Oligo AlwNI/Spel (Clontech, catalog # 6488-1) was used for both mutagenesis reactions. 
The resulting mutant gene was termed Tth DN HT (SEQ ID NO: 14, nucleic acid 
sequence; SEQ ID NO: 15, amino acid sequence). 

30 
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plasmid was over-expressed and purified using Qiapen QiaPrep Spin Mini Prep Kit (cat. 
# 27106). The vector was tested for the presence of the restriction sites by DNA 
sequencing and restriction mapping. The construct is termed Tth DN RX HT (DNA 
sequence SEQ ID NO: 19; amino acid sequence SEQ ID NO:20) 

5 

Addition of point mutations 

Plasmid DNA was purified from 200 ml of JM109 overnight culture using 
Q1AGEN Plasmid Maxi Kit (QIAGEN) according to the manufacturer's protocol to 
obtain enough starting material for all mutagenesis reactions. All site-specific mutations 

10 were introduced using the Transformer Site Directed mutagenesis Kit (Clontech) 

according to the manufacturer's protocol. One of two different selection primers, Trans 
Oligo AlwNI/Spel or Switch Oligo Spel/AlwNI (Clontech, Palo Alto CA catalog 
#6488-1 or catalog #6373-1) was used for all mutagenesis reactions described. The 
selection oligo used in a given reaction is dependent on the restriction site present in the 

15 vector. All mutagenic primers were synthesized by standard synthetic chemistry. 

Resultant colonies for both types of reactions were Exoli strain JM109. Expression and 
purification of the mutant protein was done as detailed above. 

Construction of Tth DN RX HT H786A 

20 Site specific mutagenesis was performed on pTrc99G Tth DN RX HT DNA using 

the mutagenic primer 583-001-04: 5'-CAG GAG GAG CTC GTT GGC GAC CTG GAG 
GAG-3' (SEQ ID NO:21) to generate the H786A mutant enzyme (DNA sequence SEQ 
ID NO:22; amino acid sequence SEQ ID NO:23). 

25 Construction of Tth DN RX HT (H786A/G506K/Q509K) 

Starting with the mutant Tth DN RX HT H786A, generated above, site specific 
mutagenesis was done using the mutagenic primer 604-022-02: 5-GGA GCG CTT GCC 
TGT CTT CTT CGT CTT CTT CAA GGC GGG AGG CCT-3' (SEQ ID NO:24) to 
generate this variant termed "Cleavase TthAKK", (DNA sequence SEQ ID NO:25; amino 

30 acid sequence SEQ ID NO:26). 
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Large Scale preparation of recombinant proteins 

The recombinant proteins were purified by the following technique which is 
derived from a Tag DNA polymerase preparation protocol (Engelke et al. y Anal. 
Biochem., 191 :396 [1990]) as follows. E. colt cells (strain JM 109) containing either 
5 pTrc99A TaqPol, pTrc99GTthPoI were inoculated into 3 ml of LB containing 100 mg/mi 
ampicillin and grown for 16 hrs at 37°C. The entire overnight culture was inoculated into 
200 ml or 350 ml of LB containing 100 mg/ml ampicillin and grown at 37°C with 
vigorous shaking to an A 60 o of 0.8. IPTG (1 M stock solution) was added to a final 
concentration of 1 mM and growth was continued for 16 hrs at 37°C. 

10 The induced cells were pelleted and the cell pellet was weighed. An equal 

volume of 2X DG buffer (100 mM Tris-HCi, pH 7.6, 0.1 mM EDTA) was added and the 
pellet was suspended by agitation. Fifty mg/ml lysozyme (Sigma) were added to 1 
mg/ml final concentration and the cells incubated at room temperature for 15 min, 
Deoxycholic acid (10% solution) was added dropwise to a final concentration of 0.2 % 

15 while vortexing. One volume of H2O and 1 volume of 2X DG buffer were added, and the 
resulting mixture was sonicated for 2 minutes on ice to reduce the viscosity of the 
mixture. After sonication, 3 M (NRO2SO4 was added to a final concentration of 0.2 M, 
and the lysate was centrifuged at 14000 x g for 20 min at 4°C. The supernatant was 
removed and incubated at 70°C for 60 min at which time 1 0% polyethylimine (PEI) was 

20 added to 0.25%. After incubation on ice for 30 min., the mixture was centrifuged at 

1 4,000 x g for 20 min at 4°C. At this point, the supernatant was removed and the protein 
precipitated by the addition of (NH^SC^ as follows. 

Two volumes of 3 M (NH 4 )2S0 4 were added to precipitate the protein. The 
mixture was incubated overnight at room temperature for 16 hrs centrifuged at 14,000 x g 

25 for 20 min at 4°C. The protein pellet was suspended in 0.5 ml of Q buffer (50 mM 

Tris-HCi, pH 8.0, 0.1 mM ODTA, 0.1% Tween 20). The suspended protein preparations 
were quantitated by determination of the A279 dialyzed and stored in 50% glycerol, 20 
mM Tris HC1, pH8.0, 50 mM KCI, 0.5% Tween 20, 0.5% Nonidet P-40, with 100 ug/ml 
BSA. 
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Cloning and Expression of AveFENl Nuclease 

A common method for cloning new members of a gene family is to run PCR 
reactions using degenerate oligonucleotides complementary to conserved amino acid 
5 sequences in that family, and then to clone and sequence the gene-specific PCR 

fragments. This sequence information can then be used to design sense and anti-sense 
gene-specific primers which can be used in PCR walking reactions (Nucleic Acids Res. 
1995a. 23(6)1087-1088) to obtain the remainder of the gene sequence. The sequences 
obtained from the sense and anti-sense PCR walks can then be combined to generate the 

10 DNA sequence for the entire open reading frame (ORF) of the gene of interest. Once the 
entire ORF is known, primers specific to both the 5* and the 3' end of the gene can be 
designed, and PCR reactions can be performed on genomic DNA to amplify the gene in 
its entirety. This organism-specific, amplified fragment can then be cloned into an 
expression vector, and via methods know in the art, and detailed below, the protein of 

1 5 interest can be expressed and purified. 

A. Degenerate PCR and PCR walking to obtain the sequence of the Ave FEN1 gene 

The protein sequences of the FEN1 genes from Pyrococcus furiosus (SEQ ID 
NO:27) Methanococcus jannaschii (SEQ ID NO:28), Methanobacierium 

20 thermoautotrophicum (SEQ ID NO:29), and Archaeoglobus fulgidus (SEQ ID NO:30) 
were aligned and blocks of conserved amino acids were identified. The conserved 
sequence blocks VFDG (valine, phenylalanine, aspartic acid, glycine), EGEAQ (glutamic 
acid, glycine, glutamic acid, alanine, glutamine), SQDYD (serine, glutamine, aspartic 
acid, tyrosine, aspartic acid), and GTDYN/GTDFN (glycine, threonine, aspartic acid, 
• 25 tyrosine or phenylalanine, asparagine) were chosen as sequences that would likely be 

present in all Archaeal FEN1 genes. Degenerate oligonucleotides were designed for each 
of these conserved sequence blocks. In addition to the FEN1 gene specific portion of the 
oligonucleotides a 15-nucleotide tail was added to the 5' end of the oligonucleotides to 
enable nested PCR. A different tail sequence was used depending on whether the 

30 degenerate oligonucleotide targets the sense or antisense strand of the FEN1 gene. 



51 



Printed from Mimosa 03/05/15 14:55:08 Page: 52 



WO 02/063030 



PCT/US02/03423 



Forward and/or reverse versions of the oligonucleotides were made and target the 
sense and antisense strands of the FEN1 gene respectively. The oligonucleotides are 
VFDG-Fwd (SEQ ID NO:3 1), EGEAQ-Fwd (SEQ ID NO:32) QDYD-Fwd (SEQ ID 
NO:33), EGEAQ-Rev (SEQ ID NO:34), SQDYD-Revl (SEQ ID NO:35), SQDYD-Rev2 
5 (SEQ ID NO;36), and GTDYN-Rev (SEQ ID NO:37). Two oligonucleotides were made 
for the SQDYD-Rev sequence because serine is encoded by 6 different codons. For use 
in PCR, the SQDYD-Revl and SQDYD-Rev2 oligonucleotides were mixed in a ratio of 
1 :2. For the QDYD-Fwd oligonucleotide, the requirement for mixing was avoided by 
targeting only the last four amino acids of the conserved SQDYD sequence. The 

10 GTDYN-Rev oligonucleotide also recognizes the sequence GTDFN since the codons for 
tyrosine and phenylalanine share 2 of 3 nucleotides. 

First, genomic DNA was prepared from 1 vial of the live bacterial strain as 
described below. All bacterial strains were obtained from the DSMZ (Deutsche 
Sammlung von Mikroorganismen und Zellkulturen, Acidianus arnbivalens- DSM # 

1 5 3772). When the cells were lyophilized, they were resuspended in 200 ul of TOE (10 
mM TrisHCL, pH 8.0, 1 mM EDTA, 100 mM NaCl). When the cells were in liquid 
suspension, they were spun down at 20,000 x G for 2 minutes and the cell pellets were 
resuspended in 200 jil of TNE. 20 \xl of 20% SDS (sodium dodecylsulfate) and 2 |xl of 1 
mg/ml proteinase K were added and the suspension was incubated at 65 °C for 30 

20 minutes. The lysed cell suspension was extracted in sequential order with buffered 

phenol, 1:1 phenol: chloroform, and chloroform. The nucleic acid was precipitated by 
the addition of on equal volume of cold 100% ethanoi. The nucleic acid was pelleted by 
spinning at 20,000 x G for 5 minutes. The nucleic acid pellet was washed with 70% 
ethanoi, air dryed and resuspended in 50 u,i of TE (10 mM TrisHCL, pH 8.0, 1 mM 

25 EDTA). The final DNA pellet was re-suspended in 50 |il of TE (10 mM Tris HC1, pH * 
8.0, 1 mM EDTA). 

Both reactions of the nested PCR were done using the Advantage cDNA PCR kit 
(Clontech) according to manufacturer's instructions using a final concentration of 1 u.M 
for all oligonucleotides. The first reaction is done in a 20 \il volume with one of the 6 
30 possible combinations of forward and reverse degenerate oligonucleotides, and includes 
either 1 uJ of the genomic DNA preparation described above. The cycling conditions 
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were 20 cycles of 95°C for 15 seconds, 50°C or 55°C for 15 seconds/and 68°C for 30 
seconds. The second reactions utilize primers that have the same sequence as the 5* tail 
sequence of the degenerate oligonucleotides described above. The two primers are 203- 
01-01 (SEQ ID NO:38) and 203-01-02 (SEQ ID NO:39). The second reaction is carried 
5 out exactly as described for the first reaction, except 30 cycles are done instead of 20 and 
the reaction volume is 25 ul. Following the second PCR, 5 ul of the reaction were loaded 
on a 2% or 4% agarose gel and the DNA was visualized by ethidium bromide staining. 
The expected product sizes based on the previously identified FEN1 sequences for all 
primer pairs are as follows: VFDG-Fwd and EGEAQ-Rev; 275 base pairs, VFDG-Fwd 

1 0 and SQDYD-Rev; 325 base pairs, VFDG Fwd and GTD YN-Rev; 5 1 0 base pairs, 

EGEAQ-Fwd and SQDYD-Rev; 100 base pairs, EGEAQ-Fwd and GTD YN-Rev; 290 
base pairs, QDYD-Fwd and GTD YN-Rev; 230 base pairs. The primer pair, VFDG-Fwd 
and EGEAQ-Rev was able to generate a correctly sized DNA product for all samples 
attempted. The primer pair, VFDG-Fwd and GTD YN-Rev was able to generate a 

1 5 correctly sized DNA product for most of the DNA samples attempted. 

When a DNA product of the expected size was made by the degenerate PCR, that 
DNA fragment was isolated and cloned into pGEM-T Easy (Promega) using the pGEM-T 
Easy ligation kit according to the manufacturer's instructions. The DNA sequence was 
determined and the sequence was used to generate sense and antisense genome walking 

20 oligonucleotides for cloning the remainder of the FEN1 gene. The oligonucleotides were 
designed according to the parameters of the GenomeWalker kit (Clontech) which was 
used prepare the various genomic DNA samples for the genome walking PCR reactions. 

The genomic DNA was randomly amplified using a random 12-mer 
oligonucleotide. One hundred- |il PCR reactions were set up with the Advantage cDNA 
. 25 PCR kit (Clontech) and contained 10 ul of genomic DNA and 15 uM random 12-mer 
oligonucleotide. 50 cycles were carried out with the following parameters: 95°C for 30 
seconds, 50°C for 30 seconds, 68°C for 5 minutes. After the PCR reactions were 
complete, amplified DNA was purified with the High Pure PCR Product Purification kit 
(Boehringer Mannheim). The purified DNA was eluted into a total of 200 ul of 10 mM 

30 TrisHCL, pH 8.5. 
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The genome walking protocol consists of 3 steps. First, a genomic DNA sample 
is cut with 5 different blunt-end restriction enzymes in 5 separate reactions. Second, the 
cut DNA is ligated to an adapter which serves as a tag sequence and also is designed to 
prevent background amplification. Third, the ligated DNA is amplified with a gene- 
5 specific primer and a primer with the same sequence as a portion of the adapter sequence. 

50ul restriction digests contained 30 ul of randomly amplified genomic DNA and 
the Dra I restriction enzyme. After 4 hours at 37°C, the cut DNA was purified with either 
GENECLEANII (Bio 101) or QIAEX II (Qiagen) according to manufacturer's 
instructions. DNA was eluted into 10 ul of 1 0 mM TrisHCl, pH 8.5 in either case. 5.6uJ 

1 0 of this cut DNA was used in 10 ul ligation reactions containing 6uM Genome Walker 
adapter. Reactions were carried out at room temperature overnight followed by heating 
at 70°C for 10 minutes to inactivate the T4 DNA ligase. The ligation reactions were then 
diluted with 70 ul of TE (10 mM TrisHCl, pH 8.0, 1 mM EDTA). 

One fil of the diluted ligation mix was used in 25 ul PCR reactions with 0.2 uM 

15 gene-specific primer and 0.2 uM primer AP-1 (Clontech) which has the same sequence 
as the 5' portion of the Genome Walker adapter. Ten reactions were done for each DNA 
sample. Five antisense walk PCR reactions (for the 5 different restriction enzymes used 
to cut the genomic sample) were done using the sense gene-specific primer and five sense 
walk PCR reactions were done using the antisense gene-specific primer for each DNA 

20 sample. The cycling parameters were as recommended by the Universal Genome 

Walking kit (Clontech) and were as follows: 7 cycles of 94°C for 25 seconds and 72°C 
for 3 minutes, 32 cycles of 94°C for 25 seconds and 67°C for 3 minutes, followed by 
67°C for 7 minutes. . 

The Archaeoglobus veneficus (Ave) genome walks were done as follows. The 

25 primary antisense primer was Ave 34AS (SEQ ID NO:40) and the primary sense primer 
was Ave 65S (SEQ ID NO:41). Nested PCR reactions were done using the nested primer 
AP-2 and either the nested antisense primer Ave 3 2 AS (SEQ ID NO:42) or the nested 
sense primer Ave 67S (SEQ ID NO:43). 25-ul nested reactions were done as descibed 
above for the primary PCR walk reactions. The primary reactions were diluted 1 :50 in 

30 H2O and 0.5 ul of those dilutions were added to the nested PCR reactions. The cycling 
parameters for the nested PCR reactions were as recommended by the Universal Genome 
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Walking kit (Clontech) and are as follows: 5 cycles of 94°C for 25 seconds and 72°C for 
3 minutes, 20 cycles of 94°C for 25 seconds and 67°C for 3 minutes, followed by 7 
minutes at 67°C. The nested antisense PCR reaction on Stu I cut Ave genomic sample 
generated a 1 kilobase D*NA product which was cloned into pGEM-T Easy (Promega) 
5 following manufacturer's instructions and sequenced. The nested sense PCR reaction on 
Eco RV cut Ave genomic sample generated a 1.1 kilobase product which was cloned into 
pGEM-T Easy (Promega) following manufacturer's instructions and sequenced. 

Cloning of Ave FEN-1 nuclease I into an expression vector 
1 0 PCR reactions were performed using the primers designed above and genomic 

DNA from the organism of interest. The PCR products were gel purified and then cut 
with restriction endonucleases corresponding to the sites incorporated in the PCR 
primers. The cut PCR products were then purified away from the smaller digest 
fragments and these cut products were cloned into an expression vector. In some cases, 
1 5 this was the final step of the cloning process, prior to transformation and protein 
expression/purification. In some cases a fifth step was needed. In some cases, a 
mutagenesis step had to be performed to remove any nucleotides that were incorporated 
into the ORF as a result of primer sequences required for cloning. 

Finally, a bacterial host (e.g., E. coli JM109) was transformed with the expression 
20 vector containing the cloned FEN-1, and protein expression and purification were done as 
detailed below. 

The cloning of a FEN-1 from Archaeaglobus veneficus (Ave) was performed as 
described above using the DSM # 1 1 195 genomic DNA and PCR primers Ave 5' - 3' 
TAACGAATTCGGTGCAGACATAGGCGAACTAC (SEQ IDNO:44) and 
25 Ave 3' - 5' CGGTGTCGACTCAGGAAAACCACCTCTCAAGCG (SEQ ID NO;45). 
The mutagenic oligonucleotide used was Ave ARI - 5* 

CACAGGAAACAGACCATGGGTGCAGACATAGGCGAAC (SEQ ID NO:46). The 
open reading frame (ORF) encoding the^ve FEN-1 endonuclease is provided in SEQ ID 
NO:47; the amino acid sequence encoded by this ORF is provided in SEQ ID NO:48. 

30 
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Large Scale Preparation of Recombinant Ave FEN-1 Protein 

Ave FEN-1 protein was purified by the following technique, which is derived 
from a Tag DNA polymerase preparation protocol (Engelke et aL, Anal. Biochem., 
191:396 [1990]) as follows. £ coli cells (strain JM109) containing the construct 
5 described above were inoculated into 3 ml of LB (Luria Broth) containing 100 jig/ml 

ampiciilin and grown for 16 hrs at 37°C. The entire overnight culture was inoculated into 
200 ml or 350 ml of LB containing 100 jag/ml ampiciilin and grown at 37°C with 
vigorous shaking to an A 6 oo bf 0.8. IPTG (1 M stock solution) was added to a final 
concentration of 1 mM and growth was continued for 16 hrs at 37°C. 

10 The induced cells were pelleted and the cell pellet was weighed. An equal 

volume of 2X DG buffer (100 mM Tris-HCl, pH 7.6, 0.1 mM EDTA) was added and the 
pellet was resuspended by agitation. Fifty mg/ml lysozyme (Sigma, St. Louis, MO) was 
added to 1 mg/ml final concentration and the cells were incubated at room temperature 
for 15 min. Deoxycholic acid (10% solution) was added dropwise to a final 

1 5 concentration of 0.2 % while vortexing. One volume of H 2 0 and 1 volume of 2X DG 
buffer was added and the resulting mixture was sonicated for 2 minutes on ice to reduce 
the viscosity of the mixture. After sonication, 3 M (NH 4 ) 2 S0 4 was added to a final 
concentration of 0.2 M and the lysate was centrifuged at 1 4000 x g for 20 min at 4°C, 
The supernatant was removed and incubated at 70°C for 60 min at which time 10% 

20 polyethylimine (PEI) was added to 0.25%. After incubation on ice for 30 min., the 

mixture was centrifuged at 14,000 x g for 20 min at 4°C. At this point, the supernatant 
was removed and the FEN-1 protein was precipitated by the addition of (NH 4 )2S0 4 as 
follows. 

The FEN- 1 protein was precipitated by the addition of solid (NH 4 )2S0 4 to a final 
25 concentration of 3 M (-75% saturated). The mixture was incubated on ice for 30 min 
and the protein was centrifuged at 14,000 x g for 20 min at 4°C. The protein pellet was 
resuspended in 0.5 ml of Q buffer (50 mM Tris-HCl, pH 8.0, 0.1 mM EDTA, 0.1% 
Tween 20). The resuspended protein preparations were quantitated by determination of 
the A279 • 
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INVADER assay using charged-balanced probes 

This experiment utilized the following Cy3-labeled oligonucleotide: 5'-Cy3- 
AminoT-AminoT-CTnTCACCAGCGAGACGGG-3' (SEQ ID NO: 1; termed "oligo 
6 1 "). Oligo 6 1 was designed to release upon cleavage a net positively charged, labeled 
5 product. To test whether or not a net positively charged 5'-end labeled product would be 
recognized by the CLEAVASE enzymes in the INVADER-directed cleavage assay 
format, probe oligo 61 (SEQ ID NO:l) and INVADER oligonucleotide 67 (SEQ ID 
NO:2) were chemically synthesized on a DNA synthesizer (ABI 391) using standard 
phosphoramidite chemistries and reagents obtained from Glen Research (Sterling, VA). 

1 0 Each assay reaction comprised 1 00 fmoles of M 1 3mp 1 8 single stranded DNA, 1 0 

pmoles each of the probe (SEQ ID NO:l) and INVADER (SEQ ID NO:2) 
oligonucleotides, and 20 units of CLEAVASE A/G in a 10 ul solution of 10 mM MOPS, 
pH 7,4 with 100 mM KC1. Samples were overlaid with mineral oil to prevent 
evaporation. The samples were brought to 50°C, 55°C» 60°C, or 65°C and cleavage, was 

1 5 initiated by the addition of 1 fil of 40 mM MnCh- Reactions were allowed to proceed for 
25 minutes and then were terminated by the addition of 10 uJi of 95% formamide 
containing 20 mM EDTA and 0.02% methyl violet. The negative control experiment 
lacked the target M13mpl8 and was run at 60°C. Five microliters of each reaction were 
loaded into separate wells of a 20% denaturing polyacrylamide gel (cross-linked 29:1) 

20 with 8 M urea in a buffer containing 45 mM Tris-Borate (pH 8.3) and 1.4 mM EDTA. 
An electric field of 20 watts was applied for 30 minutes, with the electrodes oriented as 
indicated in Fig. 19B {i.e., in reverse orientation). The products of these reactions were 
visualized using the FMBIO fluorescence imager and the resulting imager scan is shown 
in Fig. 19B. 

25 Fig. 19A provides a schematic illustration showing an alignment of the 

INVADER (SEQ JD NO:2) and probe (SEQ ID NO:l) along the target M13mpl 8 DNA; 
only 53 bases of the M13mpl8 sequence is shown (SEQ ID NO:49). The sequence of the 
INVADER oligonucleotide is displayed under the M13mpl8 target and an arrow is used 
above the M13mpl8 sequence to indicate the position of the INVADER relative to the 

30 probe and target. As shown in Fig. 19A, the INVADER and probe oligonucleotides share 
a 2 base region of overlap. 
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In Fig. 19B, lanes 1-4 contain reactions performed at 50°C, 55°C, 60°C, and 
65°C, respectively; lane 5 contained the control reaction (lacking target). In Fig. 19B, the 
products of cleavage are seen as dark bands in the upper half of the panel; the faint lower 
band seen appears in proportion to the amount of primary product produced and, while 
5 not limiting the invention to a particular mechanism, may represent cleavage one 

nucleotide into the duplex. The uncleaved probe does not enter the gel and is thus not 
visible. The control lane showed no detectable signal over background (lane 5). As 
expected in an invasive cleavage reaction, the rate of accumulation of specific cleavage 
product was temperature-dependent. Using these particular oligonucleotides and target, 

10 the fastest rate of accumulation of product was observed at 55°C (lane 2) and very little 
product observed at 65°C (lane 4). 

When incubated for extended periods at high temperature, DNA probes can break 
non-specificaliy (i.e., suffer thermal degradation) and the resulting fragments contribute 
an interfering background to the analysis. The products of such thermal breakdown are 

1 5 distributed from single-nucleotides up to the full length probe. In this experiment, the 

ability of charge based separation of cleavage products (i.e., charge reversal) would allow 
the sensitive separation of the specific products of target-dependent cleavage from probe 
fragments generated by thermal degradation was examined. 

To test the sensitivity limit of this detection method, the target M13mpl 8 DNA 

20 was serially diluted ten fold over than range of 1 fmole to 1 amole. The INVADER and 
probe oligonucleotides were those described above (/.&, SEQ ID NOS:2 and 1, 
respectively). The invasive cleavage reactions were run as described above with the 
following modifications: the reactions were performed at 55°C, 250 rnM or 100 mM 
KGIu was used in place of the 100 mM KCl and only 1 pmole of the INVADER 

25 oligonucleotide was added. The reactions were initiated as described above and allowed, 
to progress for 12.5 hours. A negative control reaction that lacked added Ml 3ml 8 target 
DNA was also run. The reactions were terminated by the addition of 1 0 u,I of 95% 
formamide containing 20 mM EDTA and 0.02% methyl violet, and 5 ul of these mixtures 
were electrophoresed and visualized as described above. The resulting imager scan is 

30 shown in Fig. 20. 
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In Fig. 20, lane 1 contains the negative control; lanes 2-5 contain reactions 
performed using 100 mM KGlu; lanes 6-9 contain reactions performed using 250 mM 
KGlu. The reactions resolved in lanes 2 and 6 contained 1 fmole of target DNA; those in 
lanes 3 and 7 contained 100 amole of target; those in lanes 4 and 8 contained 10 amole of 
5 target and those in lanes 5 and 9 contained 1 amole of target. The results shown in Fig. 
20 demonstrate that the detection limit using charge reversal to detect the production of 
specific cleavage products in an invasive cleavage reaction is at or below 1 attomole or 
approximately 6.02 x 10 5 target molecules. No detectable signal was observed in the 
control lane, which indicates that non-specific hydrolysis or other breakdown products do 
1 0 not migrate in the same direction as enzyme-specific cleavage products. The excitation 
and emission maxima for Cy3 are 554 and 568, respectively, while the FMBIO Imager 
Analyzer excites at 532 and detects at 585. Therefore, the limit of detection of specific 
cleavage products can be improved by the use of more closely matched excitation source 
and detection filters. 

15 

EXAMPLE 3 

Examination Of The Effects Of A 5* Positive Charge On The Rate Of Invasive 
Cleavage Using The CLEAVASE A/G Or Pfu FEN- 1 Nucleases 

20 To investigate whether the positive charges on the 5' ends of probe 

oligonucleotides containing a positively charged adduct(s) have an effect on the ability of 
the CLEAVASE A/G or Pfu FEN-1 nucleases to cleave the 5' arm of the probe, the 
following experiment was performed. 

Two probe oligonucleotides having the following sequences were utilized in 

25 INVADER reactions: Probe 34-1 80-1 : (N-Cy3)T N H2T N H2CCAGAGCCTAATTTGCC 
AGT(N-fluorescein)A, where N represents a spacer containing either a Cy3 or 
fluorescein group (SEQ ID NOS:50 or 51, respectively) and Probe 34-180-2: 5-(N- 
TET)TTCCAGAGCC TAATTTGCCAGT-(N-fluorescein)A, where N represents a 
spacer containing either a TET or fluorescein group (SEQ ID NOS:52 or 53, 

30 respectively). Probe 34-1 80-1 (SEQ ID NO:50) has amino-modifiers on the two 5' end T 
residues and a Cy3 label on the 5* end, creating extra positive charges on the 5' end. 
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Probe 34-1 80-2 (SEQ ID NO:52) has a TET label on the 5' end, with no extra positive 
charges. The fluorescein label on the 3' end of probe 34-180-1 enables the visualization 
of the 3' cleaved products and uncleaved probes together on an acrylamide gel run in the 
standard direction (Le. t with the DNA migrating toward the positive electrode). The 5' 
5 cleaved product of probe 34-180-1 has a net positive charge and will not migrate in the 
same direction as the uncleaved probe, and is thus visualized by resolution on a gel run in 
the opposite direction (i.e.; with this DNA migrating toward the negative electrode). 

The cleavage reactions were conducted as follows. All conditions were 
performed in duplicate. Enzyme mixes for the Pfu FEN-1 and CLEAVASE A/G 

10 nucleases were assembled. Each 2 u.1 of the Pfu FEN-1 mix contained 100 ng of Pfu 
FEN-1 and 7.5 mM MgCl 2 . Each 2 \l\ of the CLEAVASE A/G nuclease mix contained 
26.5 ng of CLEAVASE A/G nuclease and 4.0 mM MnCl2. Four master mixes containing 
buffer, M13mpl8, and INVADER oligonucleotides were assembled. Each 7 \i[ of mix 1 
contained 5 fmol M13mpl8, 10 pmoles INVADER oligonucleotide 123 (SEQ ID NO:54) 

15 in 10 mM HEPES (pH 7.2). Each 7 of mix 2 contained 1 fmol M13mpl 8, 10 pmoles 
INVADER oligonucleotide 123 in 10 mM HEPES (pH 7.2). Each 7 uJ of mix 3 
contained 5 fmol M13mpl8, 10 pmoles INVADER oligonucleotide 123 in 10 mM 
HEPES (pH 7.2), 250 mM KGlu. Each 7 fxl of mix 4 contained 1 fmol Ml 3mpl 8, 1 0 
pmoles INVADER oligonucleotide 123 in 10 mM HEPES (pH 7.2), 250 mM KGlu. For 

20 every 7 ^1 of each mix, 10 pmoles of either probe 34-180-1 (SEQ ID NO:50) or probe 
34-180-2 (SEQ 1DN0:52) were added. The DNA solutions described above were 
covered with 10 u.1 of CH1LLOUT evaporation barrier and brought to 65°C. The 
reactions made from mixes 1-2 were started by the addition of 2 jal of the Pfu FEN-1 mix, 
and the reactions made from mixes 3-4 were started by the addition of 2 jil of the 

25 CLEAVASE A/G nuclease mix. After 30 minutes at 65°C, the reactions were terminated 
by the addition of 8 u.1 of 95% formamide containing 10 mM EDTA. Samples were 
heated to 90°C for 1 minute immediately before electrophoresis through a 20% 
denaturing acrylamide gel (19:1 cross-linked) with 7 M urea, in a buffer containing 45 
mM Tris-Borate (pH 8.3), 1 .4 mM EDTA and a 20% native acrylamide gel (29:1 

30 cross-linked) in a buffer containing 45 mM Tris-Borate (pH 8.3), 1 .4 mM EDTA. 
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The products of the cleavage reactions were visualized following electrophoresis 
by the use of a Hitachi FMBIO fluorescence imager. The resulting images are shown in 
Fig. 21 . Fig. 21 A shows the denaturing gel, which was run in the standard 
electrophoresis direction, and Fig. 21 B shows the native gel, which was run in the reverse 
5 direction. The reaction products produced by Pfu FEN-1 and CLEAVASE A/G 

nucleases are shown in lanes 1-8 and 9-1 6, respectively. -The products from the 5 fmol 
M13mpl8 and 1 fmol M13mpl8 reactions are shown in lanes 1-4, 9-12 (5 fmol) and 5-8, 
13-16 (1 fmol). Probe 34-180-1 is in lanes 1-2, 5-6, 9-10, 13-14 and probe 34-180-2 is in 
lanes 3-4, 7-8, 11-12, 15-16. 

10 The fluorescein- labeled 3' end fragments from all cleavage reactions are shown in 

Fig. 21 A, indicated by a "3"' mark at the left. The 3 nt 5' TET-labeled products are not 
visible in this Figure, while the 5 1 Cy3-labeled products are shown in Fig. 2 IB, 

The 3' end bands in Fig, 21 A can be used to compare the rates of cleavage by the 
different enzymes in the presence of the different 5' end labels. It can be seen from this 

1 5 band that regardless of the amount of target nucleic acid present, both the Pfu FEN-1 and 
the CLEAVASE A/G nucleases show more product from the 5* TET-labeled probe. With 
the Pfu FEN-1 nuclease this preference is modest, with only an approximately 25 to 40% 
increase in signal. Jn the case of the CLEAVASE A/G nuclease, however, there is a 
strong preference for the 5* TET label. Therefore, although when the charge reversal 

20 method is used to resolve the products, a substantial amount of product is observed from 
the CLEAVASE A/G nuclease-catalyzed reactions, the Pfu FEN- 1 nuclease is a preferred 
enzyme for cleavage of Cy3-labeled probes, 

EXAPMLE 4 

25 Manual Coupling of the 5 1 Phosphoramidite (Positively Charged Phosphoramidite 
or Neutral Phosphoramidite) to Solid Support 

This example demonstrates one means by which a phosphoramidite with a 
30 positive or neutral charge can be coupled to an oligonucleotide on a solid support. The 
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coupling method described below is provided by way of example and not by way of 
limitation; other coupling methods may also prove to be effective. 

A l A inch plug of Pyrex Brand Fiber Glass Wool (Aldrich, Cat# Z 25,289-0) was 
tightly packed into a 2.5 ml gas-tight Hamilton syringe (VWR, Cat. # 90168) using first a 
5 pasteur piptte or like device to drive the glass wool to the bottom of the syringe, followed 
by compression with the syringe plunger. The plunger was removed and approximately 
40 mg of dry Control Pore Glass (CPG) support, coupled with oligonucleotide sequence 
SEQ ID NO:55 (still protected with the dimethoxy trity 1 [DMT] moiety at the 5' end) was 
added to the syringe, on top of the packed glass wool. The amount of the CPG added 

10 varies with the batch of CPG synthesized, and is specifically dependent on the amount of 
oligonucleotide loaded onto the solid support- The plunger was reinserted and depressed 
to pack the CPG coupled DNA onto the glass wool. A 5-inch, 18 gauge Luer Lock 
needle was secured to the syringe, and all reagents were drawn into the reaction vessel 
(the syringe) via the needle. The plunger remained in the syringe for the rest of the 

1 5 procedure. 

Once the plunger was reinserted, the CPG-oligonucleotide complex was washed 3 
times with methylene chloride (stored over 3-angstrom pore size, activated, Molecular 
Sieves [Aldrich, Cat # 20,858-2]) by drawing 1 mi into the syringe via the needle, 
inverting 3-5 times and ejecting the wash solution by depressing the plunger. 

20 Reactions were then washed with 1 ml of deblock (dichloroacetic acid [a 1 5% 

solution in methylene chloride was special ordered from Glen Research] diluted to 3% in 
methylene chloride) to remove the DMT as described above. Washes were performed 
until the orange color generated by the free trityl groups was completely gone, with a 
maximum incubation time of 1 minute for all 3 washes. 

25 After the final wash, the reactions were neutralized with three 1 ml washes of a 

1:1 mixture of acetonitrile:pyridine, stored over calcium hydride. This was followed by 
8, 2 ml washed with acetonitrile stored over calcium hydride. 1 .5 ml of the appropriate 
phosphoramidite solution (either 50-100 mM of the positively charged or the neutral 
phosphoramidite in acetonitrile, stored over calcium hydride) and 1 ml of activator 

30 (0.25M 5-ethylthio-lH-tetrazole [Glen Research, Cat J 30-3 140) in anhydrous 

acetonitrile over activated Molecular Sieves) was drawn up into the syringe. The needle 
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was sealed using a silicone stopper (Aldrich, CatJ Zl 6608-1) and rocked gently, by hand 
for 20 minutes at room temperature. 

After the 20 minute incubation, the solution was ejected and six 1ml washes with 
acetonitrile stored over calcium hydride were done as described above. Two ml of 
5 oxidizer (0.02M iodine in tetrahydrofuran/pyridine/water [Glen Research, CatJ 40- 
4330]) was drawn into the syringe, the needle was again sealed with a silicone stopper 
and the reaction was rocked gently at room temperature for 3 minutes. This was followed 
by 4, 1ml acetonitrile (stored over calcium hydride) washes and 2, 1ml 
acetonitrile:pyridine (1:1 mixture, stored over calcium hydride) washes. 1 ml of Cap B 

1 0 solution (10% n-methylimidazole in a solution of 8: 1 tetrahydrofuran and pyrimidine [PE 
Biosystems]) and 1 ml of Cap A (THF/Acetic Anhydride, 9:1, PE Biosystems) were 
drawn into the syringe, the needle was capped and the reaction was rocked gently for 3 
minutes at room temperature. This was followed by six 1 ml washes with 
acetonitrile:pyridine (1:1 mixture, stored over calcium hydride) and five 1 ml washes 

15 with methylene chloride stored over activated, Molecular Sieves. 

For subsequent manual couplings, the above procedure can be repeated, starting 
with the deblock washes. For subsequent automated couplings, the support can be 
transferred to a synthesis column and attach to synthesizer. If the reaction is complete, 
the 5* dimethoxy trityl can be removed by washing with deblock, neutralizing with 3three 

20 1 ml acelonitrile:pyrimidine washes, and eight 2 ml acetonitrile washes, as described 
above. 

Deprotection Protocol: 

The dried support (CPG) carrying the newly modified oligonucleotide was 
25 transferred to a 4ml glass vial (Wheaton, 224801) with a TEFLON-Iined cap (Wheaton 
240408). 1ml of concentrated ammonium hydroxide (EM Sciences AX 1303-13) was 
added and the reaction was incubated overnight at room temperature. The mixture was 
then Filter through a 0.2 urn TEFLON Acrodisc filter (Gelman, 4423T) using a 1 ml 
disposable syringe (B-D, 309602), and finally dried to completion in a speedvac. 

30 
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EXAMPLE 5 



Synthesis of Positively Charged Phosphoramidite 



5 



1) 



Preparation of mono-DMT protected 4,4 , -timethylcne(A/.y(l 
pipcrdine ethanol)): 



10 grams (33.4 rnmol) of 4,4 , -timethylene(bis-(l-piperdine ethanol)) [Aldrich, 
Cat. # 12,122-3] and 1.46 ml (8.4mmol) of N-N-di-isopropylethylamine (Aldrich, Cat. # 
38,764-9] were combined in a 250-ml round-bottom flask (such as ChemGlass, Cat J 
CG- 15 06). A magnetic stir bar was added and stirring was initiated at medium speed. 

1 0 2.84 grams (8.4mmol) of 4,4'-dimethoxytrityl chloride (Aldrich, CatJ 10,001-3) was 
added as a solid, slowly (over the course of about 1 minute) with constant stirring. The 
flask was covered with a rubber septum and the reaction was incubated at room 
temperature with continued stirring, until complete, for about 1 hour. 
The reaction was monitored by thin layer chromatography (EM Science 60F254 silica 

1 5 plates from VWR, Cat.# 5715-7) using standard methods known in the an until the 
starting material, 4,4 , -dimethoxytrityl chloride, was no longer detected on the 
chromatography plate. The reaction products were then filtered and purified by column 
chromatography using a 4.5 by 25 cm glass chromatography column (with glass frit and 
TEFLON stopcock) and 70-230 mesh, 60 angstrom silica gel (Aldrich, CatJ 28,862-4). 

20 The running solvent was a solution of 5% methanol, 5% triethylamine and 90% 

methylene chloride. Chromatography was performed by standard methods known in the 
art. The product was a yellow oil, with a yield of approximately 4.8 grams (95%) with an 
Rf value of 0.55 as determined by TLC. TLC was performed using EM Science 60F 2 54 
silica plates (VWR, CatJ 5715-7), in a running buffer of 5% triethylamine/95% dioxane. 



2) Preparation of phosphoramidite: 

1.3 grams (2.2mmol) of mono-DMT protected 4 > 4'-timethylene(bis-(l-pipirdine 
ethanol)) synthesized in the above reaction was co-evaporated in a 250 ml round bottom 
flask, three times with 20 ml of acetonitrile. A Buchi Rotovapor with dry ice/alcohol 
30 condenser, (Buchi, model number R-l 14) was used for the evaporation, and the mixture 
was dried to completion for each co-evaporation. 



25 
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The dry product was then dissolved in 12 ml of methylene chloride followed by 
an addition of 0.85 ml (2.7mmoI) of 2-cyanoethyl tetraisopropyl phosphorodiamidite 
(Aidrich, Cat.# 30,599-5). 122mg (1.7mmol/4mI) of tetrazole dissolved in 3 mi dry 
acetonitrile was added with vigorous swirling, and the reaction vessel was secured in a 
5 cork ring, taped to a vortexer and vortexed at medium speed, at room temperature, for 1 .5 
hours. The reaction was monitored by TLC and was complete when mono-DMT 
protected 4,4 , -timethyiene(bis(l-piperdine ethanol)) was no longer visible by TLC. 

25 ml of methylene chloride were added to increase the volume, and the entire 
reaction was transferred to a 100ml separatory funnel. An equal volume (approximately 

10 40 ml) of a 5% sodium bicarbonate: 1% triethylamine solution was added, the mixture 
was shaken for 15 seconds and allowed to equilibrate. The lower, organic phase was 
drained from the funnel and retained. The upper aqueous phase was discarded, the 
organic phase was transferred back to the separatory funnel and the wash was repeated 
for a total of three sodium bicarbonate/triethylamine washes. 

15 The organic phase was transferred to an Ehrlenmeyer flask and solid magnesium 

sulfate (approximately 20g) was slowly added, with swirling, until no clumping of the 
solids was detected. The magnesium sulfate was filtered via a Buchner filter funnel with 
ground glass adaptor (Chemglass, Cat J CG-1406) and the solution was concentrated and 
co-evaporated twice with 20 ml of acentonitrile on a Buchi Roto vapor in a tared, round- 

20 bottom flask. The amount of dry product was determined by mass, and then re-dissolved 
in acetonitrile to a final concentration of approximately 1 50~200mg/ml. Several granules 
of calcium hydride were added. The dissolved product was then dispensed (2 ml/bottle) 
into amber glass vials (Wheaton, Cat. # 224754) and dried, first via a water aspirator until 
the product appears as an extremely viscous oil, and then overnight under vacuum in a 

25 glass dessicator (VWR) containing phosphorous pentoxide (Aidrich, Cat. # 29822-0) and 
DR1ERITE (VWR; Cat. # 22891-040). The yield was approximately 1.6 grams (92.1%) * 
with an Rf value of 0.7 as determined by TLC. TLC was performed using pre-run EM 
Science 6OF254 silica plates (VWR, Cat.# 5715-7), in a running buffer of 5% 
triehtylamine/95% dioxane. 

30 
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EXAMPLE 6 
Synthesis of Neutral Phosphoramidite 

1) Synthesis of mono-DMT protected N-methyldithanolamine: 

5 8.3 grams (70,0 mmol) of N-methyldiethanolamine, 2.2 ml (12.6 mmol) of di- 

isopropyl ethylamine and 100 ml of acetonitrile were combined in a 250-ml round- 
bottom flask (such as ChemGiass, CatJ CG-1506), A magnetic stir bar was added and 
stirring was initiated at medium speed. 4 grams (1 1.8 mmol) 4,4-dimethoxytrityl 
chloride (Aldrich, Cat.# 10,001-3) was added as a solid, slowly (over the course of about 

10 1 minute) with constant stirring. The flask was covered and the reaction was incubated at 
room temperature with continued stirring, until complete, for about 1 hour. The reaction 
was monitored by thin layer chromatography (EM Science 6OF254 silica plates from 
VWR, CatJ 5715-7) using standard methods known in the art. The reaction is complete 
when the starting material, N-methyldiethanolamine is no longer detected on the 

15 chromatography plate. 

After the 1 hour incubation, the reaction products were concentrated using the 
Buchi Rotovapor, and then dissolved in 50 ml of methylene chloride. The dissolved 
product was transferred to a 250 ml glass separatory funnel and washed 3 times with 50 
ml of 5% sodium bicarbonate and once with saturated sodium chloride, as described 

20 above. 

The reaction products were then filtered and purified by column chromatography 
using a 4.5 X 25 cm glass chromatography column (with glass frit and TEFLON 
stopcock) and 70-230 mesh, 60 angstrom silica gel (Aldrich, Cat.# 28,862-4). The 
running solvent was a solution of 5% methanol, 5% triethylamine and 90% methylene 
. 25 chloride. Chromatography was performed by standard methods known in the art. The 
product was a yellow oil, with a yield of approximately 4.8 grams (95%), with an Rf 
value of 0.55 as determined by TLC. TLC was performed using pre-run EM Science 
6OF254 silica plates (VWR, CatJ 5715-7), in a running buffer of 5% triehtyl amine/95% 
dioxane. 

30 
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2) preparation of phosphoramidite: 

1 ,3 grams (3.2mmol) of mono-DMT protected N-methyldiethanolamine, 
synthesized in the ahove reaction, was co-evaporated in a 250 ml round bottom flask, 
three times with 20 ml of acetonitrile (ACN). A dry ice/alcohol, Bttchi Rotovapor, 
5 (Buchi, model number R- 1 1 4) was used for the evaporation, and the mixture was dried to 
completion for each co-evaporation. 

The dry product was then dissolved in 12.6 ml of methylene chloride followed by 
and addition of 1 .2 ml (3.8 mmol) of 2-cyanoethyl tetraisopropyl phosphorodiamitide 
(Aldrich, CatJ 30,599-5). 173mg (2.5mmol/4ml) of tetrazole/acetonitrile was added 

1 0 with vigorous swirling, and the reaction vessel was secured in a cork ring, taped to a 
vortex and vortexed at medium speed, room temperature, for 3 hours. 25 ml of 
methylene chloride were added to increase the volume, and the entire reaction was 
transferred to a 100 ml separatory funnel. An equal volume (approximately 40 ml) of a 
5% sodium bicarbonate: 1 % triethylamine solution was added, the mixture was shaken for 

1 5 3-5 seconds and allowed to equilibrate, and the lower, organic phase was drained from 
the funnel and saved. The upper aqueous phase was discarded, the organic phase was 
transferred back to the separatory funnel and the wash was repeated, for a total of three 
sodium bicarbonatc/triethylamine washes. The organic phase was transferred to an 
Ehrlenmeyer flask and solid magnesium sulfate (approximately 20g) was slowly added, 

20 with swirling, until no clumping of the solids was detected. The magnesium sulfate was 
filtered out via a Bttchner filter funnel with ground glass adaptor (Chemglass, Cat.# CG- 
1406), and the solution was concentrated and co-evaporated twice with 20 ml of 
acentonitrile in a Buchi Rotovapor in a tared, round-bottom flask. The amount of dry 
product was determined by mass, and as then re-dissolved in acetonitrile (and several 

25 granules of calcium hydride) to a final concentration of approximately 150-200mg/ml. 
The dissolved product was then aliquoted (2 ml/bottle) into amber glass bottles 
( Wheaton) and dried, first via a water aspirator until the product appears as an extremely 
viscous oil, then overnight under vacuum in a glass dessicator (VWR) containing 
phosphorous pentoxide (Aldrich) and DRIERITE (VWR). The yield was approximately 

30 1 .9 grams (97.0%) with an Rf value of 0.8 as determined by TLC. TLC was performed 
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using pre-run EM Science 6OF254 silica plates (VWR, Cat.# 5715-7), in a running buffer 
of 5% triehtylamine/95% dioxane. 



EXAMPLE 7 

5 Synthesis of the 1,6 Hcxanediol H-Phosphonate 

1) Synthesis of the DMT protected 1,6-HcxanedioI 

Three grams (25mmoI) of 1 ,6-hexanediol (Aldrich, Cat.24,01 1-7) was dissolved 
in 120mL of anhydrous tetrahydrofuran (THF) (Aldrich, Cat.# 18,656-2). 1 .5 mL ( 1 .lg, 

1 0 88mmol) of di-isopropylethylamine (Aldrich, Cat J 38,764-9) were added, and the 
resulting mixture (protected from moisture) was stirred at room temperature for 15 
minutes. Three grams (9 mmol) of Dimethoxytrityl Chloride (DMTCi) was then added, 
and the solution was incubated, with stirring for two hours at room temperature. The 
resulting mixture was concentrated under reduced pressure via a Btichi Rotovapor 

15 (Buchi, model R-l 14), and the concentrated material was filtered and purified via column 
chromatography using silica gel column (70-230 mesh) / Mexane: Ethyl Acetate 1:1 by 
standard methods known in the art. Fractions containing isolated material (as determined 
by TLC; Rf - 0,3) were combined and concentrated. The yield was 77% (2.9g; 7mmoi). 

20 2) Synthesis of the DMT-l,6-Hexancdiol H-phosphonatc 

All reactions described below were performed under nitrogen in a system 
protected from moisture. 

a) Synthesis of the Phosphorus Triimidazolide ( PTm 3 ) 

25 4.3 mL (5.9g; 43mmoI) of Phosphorus trichloride (PC1 3 , Aldrich, Cat.#31,011-5) was 

dissolved in lOOmL of anhydrous THF at 0°C with gentle stirring. The temperature was. 
held at 0°C, and stirring was continued while, over a period of 10 minutes, 18.8mL (18g, 
129mmol) of Trimethylsilylchloride (Me 3 Si-Cl, Aldrich, CatJC7,285-4) dissolved in 
40mL of anhydrous THF was added to the reaction. After the addition of Me3$i-CI, the 

30 reaction mixture was incubated at 0°C for 30 minutes with continued stirring, and then at 
room temperature for 30 minutes with continued stirring. Finally, the reaction mixture 
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was concentrated under reduced pressure, protected from moisture, to 75% of its original 
volume. 



b) Synthesis of H-Phosphonate 

5 5.9g (14mmol) of the DMT-protected 1,6-hexanediol synthesized above was 

dissolved in lOmL of anhydrous acetonitrile, and was then added slowly (over a period of 
about 5 minutes, with constant stirring) at room temperature, to the phosphorus 
triimidazolide (PIm3) solution. The reaction was incubated at room temperature with 
stirring for 4 hours, and then transferred to a separatory runnel containing 100 ml of 

1 0 water, 50 g of ice, 20ml of Triethylamine and 50ml of methylene chloride. The organic 
and aqueous phases were allowed to separate, and the organic (lower) fraction was 
isolated. The extraction was repeated until no DMT-containir.g material was present in 
the organic fraction as determined by TLC, described previously. Combined organic 
fractions were dried over magnesium sulfate for lhr, followed by concentration under 

1 5 reduced pressure. The concentrated product was purified by column chromatography 

using Silica gel 70-230 mesh, methylene chloride / methanol 10% / Triethylamine 5% (R f 
=0.5). 

Product containing fractions were combined and concentrated. Yield: 5.8g 
(61%). The final concentrated product was then co-evaporated 5 times with 50 ml of 
20 anhydrous Acetonitrile, dried under high vacuum for 1 8 hours and dissolved in 1 8mL of 
Pyridine/Acetonitrile 1:1. Activated Molecular sieves (3 angstrom) were added. 

EXAMPLE 8 

Manual introduction of modifications into CRE Probes using H-phosphonate 
25 chemistry 

A 2.5 ml gas-tight Hamilton syringe (VWR, Cat.#90168) was loaded (as detailed 
in Example 4) with 1 umol CPG support (DMT on) coupled with a DNA CRE probe (for 
example, SEQ ID NO:55). 
30 To remove the DMT, the CPG/oligonucleotide complex was washed twice (as 

described in Example 4) with 1 ml of methylene dichioride, then washed for 1 minute 
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with 5 ml of 3% dichloroacetic acid in methylene dichloride. The reaction was then 
washed 10 times with 1 ml of anhydrous acetonitrile/pyridine 1:1. After the final wash, 
one of 5 different H-phosphonate moieties (the 1,6 hexanediol H-phosphonate 
synthesized in Example 7; dA-H-Phosphonate, dC-H-Phosphonate, dG-H-Phosphonate, 
5 or dT-H-Phosphonate [Glen Research, CatJ 10-1200-05, 10-1210-05, 10-1220-05, 10- 
1230-05]) was added as follows. 1 ml of H-phosphonate solution (concentration: 50 - 
150umol/mL) and ImL of the trimethylacetyl chloride solution in anhydrous 
acetonitrile/pyridine 1:1 (concentration; 100 - 250umol/mL) were drawn into the syringe, 
the needle was sealed and the reaction was incubated at room temperature with gentle 

10 shaking for 5-10 minutes. The syringe contents were expelled, and 6, 1ml 

acetonitrile/pyridine 1 : 1 washes were done. After the last wash, 0.1-0.2g of a primary or 
secondary amine (for example N,N-dimethyiethylenediamine, Aldrich, Cat.#Dl 5,780-5) 
in 1 mL of anhydrous pyridine, followed by 0.5 mL of anhydrous carbon tetrachloride 
were drawn into the syringe and incubated at room temperature, with gentle shaking for 

15 5-15 minutes. The syringe contents were expelled, and six 1 ml anhydrous 

acetonitrile/pyridine 1 : 1 washes were done. This was followed by six 1 ml methylene 
chloride washes; a 1 minute wash with 5 ml 3% dichloroacetic acid/methylene dichloride; 
ten 1 ml washes with anhydrous acetonitrile/pyridine 1:1 and six 1 ml washes with 
methylene chloride. 

20 The dried support (CPG) was transferred to a 4ml glass vial (Wheaton, 224801) 

with a TEFLON-lined cap (Wheaton 240408). 1 ml of concentrated ammonium 
hydroxide (EM Sciences AX 1303-13) was added and the reaction was incubated for 12 
hours at 55°C. After the cleavage and deprotection was completed, the product 
containing ammonia solution was concentrated under reduced pressure and subjected to 

25 ion exchange HPLC or reverse phase HPLC purification. 

For all HPLC purifications, the Hitachi HPLC (Interface mode!# D-7000; pump 
model# 7100; diode array detector model# L-7455) system, and standard methods known 
in the art were used. The specific conditions used for the Reverse Phase HPLC 
purification were: C-18 Dionex analytical column (4.6x250mm) with a flow rate of 1 

30 mi/min, starting with 100% buffer A (0. 1 M TEA A) and 0% buffer B (acetonitrile), and 
transitioning to buffer B at a rate of 1 % buffer B per minute. Fractions were collected 
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and analyzed via mass spectrometry by methods known in the art, to identify the 
complete product. 

The specific conditions used for the ion exchange HPLC purification were: 
Amersham Pharmacia Biotech HR 10/10 15Q IE column (10X1 00mm) with a flow rate 
5 of 5 ml/min. Buffer A (20mM sodium perchlorate, 20 mM sodium acetate, 1 0% 
acetonitrile, pH 7.35) and Buffer B (600 mM sodium perchlorate, 600 mM sodium 
acetate, 1 0% acetonitrile, pH 7.35) were used in a gradient beginning and ending at 
5%A/95%B, with a gradient increase of approximately 65%B per minute. Fractions were 
collected and analyzed by mass spectrometry by methods known in the art, to identify the 
10 desired product. 

EXAMPLE 9 

Effect of tag modifications on the INVADER Assay reaction 

1 5 In this example, oligonucleotide probes containing positively charged tags at their 

5' ends were tested in INVADER assay reactions, and the reaction turnover rates using 
two, differently modified probe oligonucleotides were compared. Here, turnover rate is 
defined as the number of cleavage events per target per unit time. The turnover rates 
were determined as described in (Lyamichev, el ai. t Biochemistry 39:9523 [2000]). 

20 The first oligonucleotide probe, 5'-Cy3-AminoT-AminoT-ACG CCA CCA GCT- 

3' (SEQ ID NO:56, termed 203-85-5), utilized AminoT modifications such as those 
described in Example 2. 

The second oligonucleotide probe, 5*-V-(Hex)-Cy3-CGC TGT CTC GCT-3* (SEQ 
ID NO:57, termed 490-52), was synthesized using the H-phosphonate modification V- 

2.5 (Hex), depicted in Figure 11. The INVADER-directed cleavage of probes 203-85-5 and 
490-52 was designed to release net positively charged Cy3-labeled products 5'-Cy3- 
A'minoT-AminoT-3 ! and 5*-V-(Hex)-C-3\ respectively. The first product is generated by 
enzymatic cleavage after AminoT, whereas the second product is produced by the 
cleavage after a natural base C. 

30 The INVADER oligonucleotide 5'- GCT CAA GGC ACT CTT GCC C-3* (SEQ 

ID NO:58, termed 203-85-4) and the target oligonucleotide 5'-ATG ACT GAA TAT 
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AAA CTT GTG GTA GTT GGA GCT GGT GGC GTA GGC AAG AGT GCC TTG 
ACG ATA-3* (SEQ ID NO:59, termed 203-85-3) used with the probe 203-85-5 were 
synthesized using phosphoramidite reagents obtained from Glen Research and standard 
phosphoramidite chemistries known in the art. The underlined nucleotides denote 2'-0- 
5 methyl modifications. The INVADER and target oligonucleotides used with the probe 
490-52 were combined into the single molecule 5'-bio.tin-TTT TTT TTT AAT TAG GCT 
CTG GAA AGA CGC TCG TGA AAC GAG CGT-3* (SEQ ID NO:60, termed ITS). Ail 
oligonucleotides were gel purified and quantitated as described (Lyamichev, etal y 
supra). 

1 0 The INVADER assay reactions utilizing the AminoT-modified probe 203-85-5 

were performed as follows: 1 0 u.1 reactions were prepared and contained (final 
concentrations): 2 uM amino modified probe (203-85-5), 1 uM INVADER 
oligonucleotide 203-85-4 (SEQ ID NO: 58), 1 nM target oligonucleotide 203-85-3 (SEQ 
ID NO:59), 32 nM AfuFENl CLEAVASE enzyme, 10 mM MOPS, pH 7.5, and 4 mM 

15 MgCl 2 . 

The INVADER reactions utilizing probe 490-52 (2 u,M) were prepared as above, 
except I nM of the ITS oligonucleotide (SEQ ID NO:60) was used, and served as both 
the INVADER oligonucleotide and the target oligonucleotide. 

The reactions were assembled on ice in 200u,l thin wall PCR tubes (Dot 

20 Scientific, Cat.#620-PCR), overlaid with 10 u.1 of Chill-out liquid wax (MJ Research) and 
transferred to a Mastercycler heating block (Eppendorf, Cat.# 5331 000.045). The 
reactions were incubated for 60 minutes at 55.3, 57.7, 60.5, 63.4, 66.2, and 68.7°C using 
a temperature gradient of 62±10°C (controlled by the heating block). The reactions were 
stopped after 1 hour with the addition of 1 0 uL of 95% formamide containing 20 mM 

25 EDTA and 0.02% methyl violet. 

One microliter aliquots of each reaction were loaded onto each of two 200x200x1 
mm slabs of 15% denaturing polyacrylamide gel (crosslinked 19:1) with 7 M urea in a 
buffer containing 45 mM Tris borate, pH 8.3 and 1 mM EDTA. An electric field of 20 
watts was applied for 30 minutes with the positive electrode connected either to the top 

30 buffer reservoir (reverse orientation) or bottom reservoir (normal orientation). The net 
positively charged products generated in the course of the INVADER reactions were 
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detected by gel electrophoresis in the reverse orientation and the uncleaved probes of the 
same samples were analyzed by separation in the normal orientation. The intensities of 
bands corresponding to the products and uncleaved probes were measured using FMBIO- 
100 fluorescence imager (Hitachi, Alameda, CA) equipped with 532-nrn laser and 585- 
5 nm filter at 1 0% sensitivity level. 

The measured turnover rates for probes 203-85-5 <SEQ ID NO:56) and 490- 
52,(SEQ ID NO:57) as a function of temperature are shown in Fig. 22. The probe 490-52 
which was synthesized using H-phosphonate chemistry to introduce the modification V- 
(Hex), has approximately 10-fold greater turnover rate than the AminoT modified probe 
10 203-85-5. 

EXAMPLE 10 

Detection of Specific Cleavage Products by Charge Reversal 

1 5 This example demonstrates that a CLEAVASE enzyme that recognizes cleavage 

structures containing RNA targets (CLEAVASE TthAKK) also recognizes and cleaves 
structures containing RNA targets and the above-described positively charged probe 
oligonucleotides. In this example, 5 different, modified probe oligonucleotides were used 
in an INVADER reaction to detect human MCP1 in vitro transcripts. Each probe 

20 oligonucleotide was designed to release a labeled product with a net positive charge such 
that the cleavage products could be delected using charge reversal methods. 

The five different, 5'-end modified, Cy 3 -labeled probe oligonucleotides tested 
were: 5*- V-(HEX)-Cy3 -CTTCGGAGTTTGGG-NH 2 -3' (SEQ ID NO:61; termed "oligo 
PI"), 5'- V-(dA)-Cy3 -CTTCGGAGTTTGGG-NH2-3' (SEQ ID NO:62; termed "oligo 

25 P2"), 5'- V-(dC)-Cy3 -CTTCGGAGTTTGGG-NH 2 -3' (SEQ ID NO:63; termed "oligo 
P3"), 5'- V-(dG)-Cy3 -CTTCGGAGTTTGGG-NH2-3' (SEQ ID NO:64; termed "oligo 
P4"), and 5'- V-(dT)-Cy3 -CTTCGGAGTTTGGG-NH2-3' (SEQ ID NO:65; termed 
"oligo P5 M ) (Fig. 23). The 5 ! modifications were synthesized as described previously, and 
all 5 of the above oligonucleotides and the INVADER oligonucleotide, lnvl 5'- 

30 GGGTTGTGGAGTGAGTGTTCAAGTA-3'(SEQ ID NO:66) were chemically 
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synthesized on a DNA synthesizer (ABI 391) using standard phosphoramidite chemistries 
and reagents obtained from Glen Research (Sterling, VA). 

All probe oligonucleotides were purified by Anion exchange HPLC. There was 
one major and one or more minor peaks observed with this purification method. The 
5 material from the major (first) peak was used in all experiments described below. 

In vitro transcripts were synthesized as follows. The human Ubiquitin cDNA was 
isolated from a first-strand human liver cDNA library (Clontech Cat #7407-1) by PCR 
using a universal 5' primer (API, S'CCATCCTAATACGACTCACTATAGGGC-S', SEQ 
ID NO:67) provided with the library and a Ubiqui tin-specific 3' primer (5*- 

1 0 CTCATACAGTTAC1T GTCTTC-3', SEQ ID NO:68). PCR reactions were performed 
with an error-correcting polymerase mixture from Clontech (Cat # 8417-1) according to 
manufacturer's instructions. The expected size of the PCR products was 500 bases. PCR 
products were gel purified on 1 % agarose gel run in 0.5X TBE. The gel was Stained in 
lOu-g/ml ethidium bromide, visualized under UV light, the appropriately sized band was 

1 5 excised and the DNA recovered with a Ql Aquick Gel Extraction Kit (Qiagen Cat 

#28706). The gel-purified fragment was then cloned into the pCR2.1-TOPO cloning 
vector (Invitrogen, Cat. # K4500-0J) by methods known in the art. Positive clones were 
selected and insert identity was confirmed by DNA sequencing. The positive plasmids 
were transformed into TOPI 0 cells (Invitrogen). Cells were grown and plasmid isolated 

20 by methods well known in the art of molecular biology. The same 5' and 3* primers used 
above were then used in PCR reactions to generate templates for use in in vitro 
transcription reactions. In vitro transcriptions were done performed using the Ambion T7 
MEGAshortscript RNA Transcription Kit (Ambion, Cat.# 1354) according to the 
manufacturer's instructions. The resulting human ubiquitin transcript is SEQ ID NO:69. 

25 Note that the use of the API 5' primer includes the T7 RNA polymerase promoter, which 
is necessary for the generation of in vitro transcripts. All transcripts used in the following 
reactions contained tRNA (Sigma) at 20ng/ uJ as carrier. 

HMCP1 in vitro transcripts were synthesized as follows. The human Monocyte 
Chemoattractant Protein- 1 (hMCP-1) cDNA was obtained from 10 ug/ml Con-A 

30 (concanavalin-A) and PHA (phytohemagglutinin) stimulated human PMBC's (Peripheral 
Blood Mononuclear Cells) total RNA. Total RNA was isolated from lxlO 7 cells with 
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TRJzol®Reagent (Gibco BRL Cat #15596) according to the manufacturing protocol. 500 
ng of total RNA was used for reverse transcription using the GeneAmp RNA PCR kit 
(Perkin Elmer cat #N808-0017) for the generation of the cDNA. This RT-PCR was 
performed using a gene specific 5' primer that also contained the T7 RNA polymerase 
5 promoter site (5*- 

GGAATACGACTCACTATAGGGAAAGTCTCTGCCGCCCTTCTGTGCCTGCTGC- 
3', SEQ ID NO:70) and a 3' hMCP-specific primer (5'- 

AATAGTTACAAAATATTCATTTCCACAATAA-3\ SEQ ID NO:71). The 665 base 
fragment was re-amplified using the same PCR primers and Taq DNA Polymerase 

1 0 . (Perkin Elmer Cat. #N808-01 52). The fragment was column purified using the Wizard® 
PCR Preps DNA Purification System (Promega Cat # A7170) and quantitated by O.D. 26 o 
measurement. In vitro transcription was performed using 600 ng of the purified PCR 
product in the Ambion T7 MEGAshortscript RNA Transcription Kit (Ambion Cat #1354) 
according to the manufacturer's protocol. The hMCP in vitro transcript generated (SEQ 

1 5 ID NO:72) was 647 nt long. 

The solution of the in vitro transcript was mixed with an equal volume of loading 
dye (95% Formamide, lOmM EDTA, Methyl violet dye), heat denatured at 90°C for 3 
minutes and then loaded on a 6% denaturing (19:1 cross-linked) with 7 M urea 
acrylamide gel run in 0.5X TBE. After the electrophoresis, one of the glass plates was 

20 removed and the gel was covered with plastic wrap. The gel then was placed wrap-side- 
down on the TLC (DC Fertigplatten Kieselgel 40 F254 Merck, Art 5634) plate and the 
other glass plate was removed. The RNA bands were visualized in the dark room by 
shining a hand-held UV light source (254nM; short wave) on the surface of the gel. The 
nucleic acid will appear as dark bands while the TLC plate will appear green. The bands 

25 corresponding to the RNA were excised with a razor blade and eluted in TE (10 mM Tris, 
0.1 mM EDTA) containing 0.3 M sodium acetate at 37°C for 4 hours. The in vitro 
transcript was ethanol precipitated at -20°C over night (alternatively, precipitation at -70° 
C for lhour is also sufficient) and pelleted at 14,000 rpm for 30 min at 4°C. The pelleted 
nucleic acid was then washed with 70 % ethanol and spun again for 5 minutes. After the 

30 ethanol was discarded, the pelleted nucleic acid was dried under vacuum and resuspended 
in RNase-free H 2 Q (USB Cat #US70783). The concentration of the in vitro transcript 
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was determined by OD260. All dilutions of the in vitro transcript used in the reactions 
were prepared in 20ng/uJ of yeast tRNA (Sigma Cat # R5636). 

Five sets of reactions were done, one for each different probe oligonucleotide. A 
negative (no-target) control containing 100 ng of yeast tRNA was performed for each 
5 reaction set. Each 10 |il reaction was prepared at room temperature as follows. Five 
different master mixes were prepared, one for each probe. Each mix comprised (final 
concentration): 10 mM MOPS, pH 7.5, 100 mM KCl, 0.05% Tween, and 0.05 % Nonidet 
NP40, 1 2.5 mM MgS0 4 , 5 pmoles of INVADER oligonucleotide (SEQ ID NO:66) and 
20ng of CLEAVASE TthAKK enzyme. Finally, 10 pmoles of one of the probes (SEQ ID 

1 0 NOS:61 , 62, 63, 64 or 65) were added for a final volume of 10 ul per reaction/per master 
mix. The master mixes were vortexed briefly and 5 u.1 of each was transferred to the 
appropriate reaction vessel (200 ul thin wail PCR tubes, Dot Scientific, Cat. #620-PCR), 
followed by the addition of 5 ul (containing 0, 0.1, 1 or 10 fmoles) of human MCP1 in 
vitro transcript. 1 00 ng of yeast tRNA (Sigma) was used as a negative control. Samples 

1 5 were pipetted up and down 3 times to mix. The samples were then overlaid with 1 Oul 
colored Chili out 14 liquid wax (MJ Research) to prevent evaporation and incubated at 
63 °C for 60 min. Reactions were terminated by the addition of 50 u.1 of 95% formamide 
containing 10 mM EDTA. 

Samples were run on a 15% denaturing acrylamide gel (19:1 cross-linked) with 7 

20 M urea, in a buffer containing 45 mM Tris-Borate (pH 8.3), ImM EDTA. The gel was 
pre-run, with the electrodes in the normal orientation prior to loading. The samples were 
heated to 90°C for 1 minute immediately before loading, and 2 |il were loaded per well. 
An electric field of 20 watts was applied for 30 minutes with the electrodes in the normal 
orientation. The products were visualized following electrophoresis with a Hitachi 

25 FMBIO fluorescence imager with 585-nM filter at 20 % sensitivity. The gel was then 
replaced on the running apparatus, and fresh buffer was added to the reservoirs. The 
electrodes were then placed in the reverse orientation, the gel was pre-run and loaded as 
above. The gel was run for 1 hour in the reverse orientation, and products were 
visualized as above. The resulting images are shown in Fig. 24. Fig. 24A shows the 

30 denaturing gel, which was run in the standard electrophoresis direction, and Fig. 24B 
shows the denaturing gel, which was run in the reverse direction. Probe V-(HEX) panel 
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A; probe V-(dA) panel B; probe V-(dC) panel C; probe V-(dG) panel D; and probe V- 
(dT) panel E. 



5 EXAMPLE 11 

Effects Of a 5 f Positive Charge on Cleavage Rate using CLEAVASE 
TthAKK Enzyme 

The previous example demonstrated the ability of the CLEAVASE TthAKK ' 

10 enzyme to recognize and cleave a cleavage structure containing an RNA target and a 

positively charged probe oligonucleotide. This example tests the effect of the positively 
charged probes on cleavage rates. 

All 5 of the positively charged probe oligonucleotides described in Example 10 
were tested against a 5' fluorescein labeled "control" probe oligonucleotide (SEQ ID 

15 NO:73; 5* fluorescein phosphoramidite from Glen Research). Both the positively 

charged and the control probe were designed to detect the same sequence, so are identical 
in the analyte specific region. The difference between the fluorescein labeled and the 
CRE-V labeled probes include the charge difference at the 5' end, and the length of the 
cleaved products, or 5' flap. The 5' flap of the positively charged probes is 1 base, while 

20 the control probe yields a 3 base, 5' flap. 

Reactions were performed as described in Example 10, using the hMCPl in vitro 
transcripts as target. Only one target level was used to test the cleavage rate for each 
probe oligonucleotide. Each reaction received either 1 fmole of the hMCPl in vitro 
transcript with 100 ng of yeast tRNA as carrier; 100 ng of yeast tRNA also served as a 

25 negative control. Reactions containing target were done in quadruplicate, while the 
tRNA control reactions were done singly. 

Turnover rates were determined as described in Lyamichev, et aL 9 supra, and are 
shown graphically in Fig. 25. The rates ranged from 2-to 9 cleavage events/target/minute 
with P3 (SEQ ID NO:63) showing the highest rate among the positively charged probes. 

30 The average cleavage rate of the fluorescein labeled probe was 12 cleavage 
events/target/minute. 
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EXAMPLE 12 

Examination of the Rate of Background Accumulation With 5' Positively Charged 

Probe Oligonucleotides 

A key advantage to using positively charged probe oligonucleotides is the ability 
5 to completely separate signal (e.g., the single base flap carrying the positively charged 
signal molecule) from any other aberrant reaction products or uncleaved probes using 
simple, reverse polarity gel electrophoresis, as described and detailed in the above 
examples. This experiment confirms that background cleavage products (aberrant 
cleavage, or thermodegradation products) will not migrate in the reverse polarity gel, 

1 0 even if the reaction is incubated with large amounts of target for an extended period of 
time, allowing for greater certainty and simplicity in data interpretation. 

The probe oligonucleotide used was P2 (described in Experimental Example 10, 
SEQ ID NO:62) and the INVADER oligonucleotide used was Invl (SEQ ID NO:66), 
also described in Example 10. The reaction conditions and gel based separation method 

15 were performed as described in 1 0. Reactions were performed with 0 (100ng/5 u.1 of 
tRNA as a negative control; background estimate), 0.01, 0.1 and 1 fmole of hMCPl in 
vitro transcript in a lOul reaction volume. Reactions were assembled as described in 
Example 10, and incubated for 1, 2, 4, 8 and 24 hours at 63°C. Reaction products were 
separated in normal or reverse polarity gels, as described in Example 10, and were 

20 analyzed based on the intensities from the Hitachi FMBIO scanner images and software, 
also described in Example 1 0. The results are shown graphically in Fig. 26. Fig. 26A 
represents the results of the denaturing gel, which was run in the standard electrophoresis 
direction, and Fig. 26B represents the results of the denaturing gel, which was run in the 
reverse direction. 

25 

EXAMPLE 13 

Detection of an RNA target using multiple, positively charged probes. 

The previous experiments have demonstrated that the positively charged probes 
cleaved in a structure specific manner by the CLEAVASE enzyme, can be used to detect 
30 RNA targets, and, in certain detection platforms, can be analyzed such that the signal to 
background ratio is superior to "normal/* negatively charged probe oligonucleotides. 



78 



Printed from Mimosa 03/05/15 14:56:23 Page: 79 



WO 02/063030 PCT/US02/03423 

The present experiment demonstrates that the cleavage products of different, 5' 
positively charged probes can be distinguished (based on the different mass to charge 
ratios), even when used in the same reaction. 

The oligonucleotides used in this experiment, the reaction conditions, gel-based 
5 separation and the analysis were conducted as described in Example 1 0, except that 2 
pmoles of each of 4 different probes [PI, P2, P4, and P5] were used, and the target levels 
were 0 (100 ng of tRNA only), 0.1, 1 and 30 fmoles of hMCPl in vitro transcript. Two 
u.1 of each reaction was loaded on the gel in reverse polarity and separated as described. 
The resulting image is shown in Figure 27. All cleavage products have a net 
10 positive charge. The mobility of the cleaved products from probe oligonucleotides PI , 
P2 and P4 were easily separated on the gel due to the differences in size (molecular 
weight) between them. In contrast, the cleaved products from the P5 probe 
oligonucleotide were barely distinguishable from the P4 products; the size and charge of 
these products are very similar. This demonstrates that a preferred, multiplex 
15 embodiment utilizes probes whose cleaved products can be easily distinguished in the 
detection system of choice. 

EXAMPLE 14 

Human MCP1 and Human Ubiquitin in vitro Transcript Detection in a Cascade 
20 Reaction with Positively Charged Tags 

In this example, a two-step, sequential invasive cleavage reaction is used to detect 
both hMCPl and hUbiquitin in vitro transcripts, in a true, multiplex reaction (both targets 
are detected in the same reaction). The positively charged probes (termed reporter 

25 oligonucleotides, or reporter-labeled oligonucleotides in this example) are used in the 
second step of the sequential invasive cleavage reaction, as shown in Fig. 28A and B. 
The added amplification provided by the cascading INVADER scheme yields greater 
sensitivity and lower limits of detection, important if target levels are limiting. 

The mechanism of the sequential invasive cleavage reaction is as follows. The 

30 primary INVADER and probe oligonucleotides (those which hybridize to the target) are 
unlabeled and, when hybridized to the appropriate target sequence, form the overlapping 
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structure recognized by the CLEAVASE enzyme (Fig. 28A). The enzyme cuts the 
structure and frees the 5' flap. The flap then acts as an INVADER oiigo for the 
secondary reaction. The secondary reaction comprises 3 different oligonucleotides: 1) a 
flap-reporter bridging oligonucleotide that has adjacent regions complementary to both 
5 the 5 ! flap and the reporter- labeled, secondary probe oligonucleotide; 2) a reporter- 
labeled, secondary oligonucleotide, complementary to a portion of the bridging 
oligonucleotide, and 3) the INVADER oligonucleotide, which is the 5' flap from the 
primary reaction, and which is complementary to a portion of the bridging 
oligonucleotide. When the overlapping structure forms in the secondary reaction, the 

1 0 enzyme cleaves the 5* flap from the reporter-labeled oligonucleotide, generating 
detectable signal with a positive charge. 

In the secondary reaction, the 5'-flaps of the uncleaved probe molecules can 
compete with the released 5'~flaps for hybridization to the flap-reporter bridging oligo, 
thus decreasing signal generation in the secondary reaction. To avoid this competition, 

15 the uncleaved probe is sequestered after the primary incubation by the addition of a 
complementary oligonucleotide called an "ARRESTOR oligonucleotide." The 
ARRESTOR oligonucleotide is fully complementary to the target-specific region of the 
probe, and partially extends into the 5'-flap region; thus, it does not interfere with the 
binding of the 5'-flap to the flap-reporter bridging oligonucleotide. ARRESTOR 

20 oligonucleotides thus promote more effective signal generation in the secondary reaction 
by preventing interactions between uncleaved probes and flap-reporter binding 
oligonucleotides. All of the bases of the ARRESTOR oligonucleotide are 2* O-methyl- 
modifled, making the ARRESTOR oligonucleotide resistant to cleavage by the 
CLEAVASE enzyme. 

25 The tag used for the hMCPl secondary, reporter probe oligonucleotide was 5' 

V(dC)-Cy3 (Fig. 28A), while the hUbiquitin secondary, reporter probe oligonucleotide 
incorporated the 5 r V(dG)-Cy3 tag (Fig. 28B). These tags were chosen since, as 
demonstrated in Example 10 and shown in Figure 24 they are easily separated and 
identified due to the difference in mass-to-charge ratio between them. The 

30 oligonucleotides used for the detection of Human MCP1 in vitro transcripts were: the 
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primary probe oligonucleotide 5 , -CCGTCACGCCTCClTCGGAGTTTGGG-NH 2 -3' 
(SEQ ID NO:74), the primary INVADER oligonucleotide Invl (SEQ ID NO:66), the 
arrestor oligonucleotide S' AACCCAAACTCCGAAGGAGGCGTG -NHaO' (SEQ ID 
NO:75), the flap-reporter bridging oligonucleotide 5' 
5 GCGCAGTGAGAATGAGGAGGCGTGACGGT^NH2-3' (SEQ ID NO:76), and the 
reporter-labeled secondary probe oligonucleotide 5'-V(dC)~ Cy3 

CTC ATTCTC AGTGCG-3 1 (SEQ ID NO:77). The underlined bases denote 2'-0-methyl 
modifications. The oligonucleotides used for the detection of Human Ubiquitin in vilro 
transcripts were: the primary probe oligonucleotide 5'- 

1 0 AACGAGGCGCACCTTTACATTTTCTATCGT- NH 2 -3' (SEQ ID NO:78), the primary 
INVADER oligonucleotide S'-CCTTCCTTATCCTGGATCTTGGCAO' (SEQ ID NO: 
79, the ARRESTOR oligonucleotide 5 'ACGATAGAAAATGTAAAGGTGCGC NH 2 -3* 
(SEQ ID NO:80), the flap-reporter bridging oligonucleotide 5 1 - 
CGGAAGAAGCAAGTGGTGCGGCT C GTTAA -NH 2 -3' (SEQ ID NO:81, and the 

1 5 secondary reporter- labeled probe oligonucleotide 5 , -V(dG)-Cy3 CACTTGCTTCCTCC-3 , 
(SEQ ID NO:82). Three control reaction sets were included in this experiment: 1) 
control reaction using a non-cascading reaction (basic INVADER, described in Example 
10) to detect hMCPl transcripts, using the 5 # V(dC) probe (P3, SEQ ID NO:63) and the 
INVADER oligonucleotide Invl (SEQ ID NO:66) also used in Example 10; 2) a control 

20 reaction set designed to demonstrate the lack of cross reactivity between the 

oligonucleotides used for the detection of one target and the signal generating mechanism 
of the other target; and 3) a control set in which all primary and secondary components 
were present as for the multiplex reaction, but only one secondary reporter 
oligonucleotide was present: either for the detection of hMCPl or hUbiquitin. 
- 25 The primary reaction volumes were 1 0 p,l and secondary reaction volumes were 

15 uJ. Each assay reaction comprised of 0, 1, 10 100 or 1000 amoles human ubiquitin 
and/or MCP1 in vitro transcript (SEQ ID NOS: 69 or 72, respectively) for the single and 
multiplex reactions, 10 pmoles each of the primary probe oligonucleotides (SEQ ID 
NOS:71 and 75) 5 pmoles of each primary INVADER (SEQ ID NO:66 and 79) 

30 oligonucleotides, and 20ng of CLEAVASE TthAKK enzyme in a 10 jal solution of 10 
mM MOPS, pH 7.5, 1 00 mM KC1., 0.05% Tween, 0.05 % Nonidet NP40, 1 2.5mM 
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MgS0 4 . Reactions were performed by dispensing 5 fal of the appropriate primary 
reaction mix (buffer, enzyme, MgS0 4 , primary probe oligo and primary INVADER 
oligonucleotide) into the reaction vessel (low profile MJ Research, Inc. Cat.#MLL9601) 
and then adding 5 |il of target, or tRNA as the negative control. Samples were overlaid 
5 with colored Chill-out 14 liquid wax (MJ Research) to prevent evaporation and incubated 
at 60°C for 60 minutes. 

After the primary reactions were completed, 5u.l of the appropriate secondary 
reaction mixture (2.5 pmoles of appropriate flap-reporter bridging oligonucleotide [SEQ 
ID NOS: 76 and/or 8 1] 40 pmoles of ARRESTOR oligonucleotide [SEQ ID NOS:75 
10 and/or 80] and 10 pmoles of each secondary reporter-labeled oligonucleotide [SEQ ID 
NOS: 77 and 82] such that the final concentration of the secondary reaction was 10 mM 
MOPS, pH 7.5, 0.05% Tween, 0.05 % Nonidet NP40, 20 mM MgS0 4 ) were added to 
each reaction and incubated at 60°C for Ihour. 

The reactions were stopped by addition of 50ul of stop buffer containing 95% 
1 5 formamide and 1 0 mM EDTA. Two jil of each reaction were analyzed by both normal 
and reverse polarity gel electrophoresis. Samples were heated to 90°C for 1 minute 
immediately before electrophoresis through a 15% denaturing acrylamide gel (19:1 
cross-linked) with 7 M urea, in a buffer containing 45 mM Tris-Borate (pH 8.3), 1 .4 mM 
EDTA. An electric field of 20 watts was applied for for 1 hour in reverse orientation. 
20 The gel was scanned on the Hitachi FMBIO-100 fluorescence imager with 585-nM filter 
at 20 % sensitivity. 

Images of the reverse polarity gel are shown in Fig. 29, panel A: basic non- 
cascading reaction; panel B: multiplex, cascading reaction; panel C: cascading reaction 
with MPC1 reporter oligo; and panel D: cascading reaction with Ubiquitin reporter oligo. 



30 
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EXAMPLE 15 

Detection of Human MCPJ and Ubiquitin Transcripts from Cell Lysates with a 

Multiplex CRE Format 

5 The previous experiment demonstrated that the positively charged probe 

oligonucleotides can be used to detect in vitro transcripts in a cascading, invasive 
cleavage reaction, and that they function well in a true, multiplex reaction format. The 
present experiment demonstrates that the assay format described in Example 14 can be 
used to detect both the hMCPl and hUbiquitin transcripts from cell lysates, and from 

1 0 preparations of total cellular RNA. 

Cell lysates and total RNA were prepared from MG 63 cells (ATCC # CRL- 
1 427). The cells were grown according to instructions supplied by ATCC, and by 
standard methods known in the art. Cells used for the lysate preparation were grown in 
96 well fiat bottom tissue culture plates, while cells used for the total RNA preparation 

15 were grown in 10 cm tissue culture dishes. Prior to either procedure, cells were 
stimulated with both human tumor necrosis factor-a (TNF-a [Calbiochem, Cat.# 
654205]) and human interleukin-ip (IL-lp [Calbiochem, Cat.# 407615]). The final 
concentration in the induction medium was lOng/ml for both TNF-a and for IL-l p. 

Cell lysates were prepared as follows: Prior to lysis, cells were washed 2X with 

20 200p.l of phosphate buffered saline (PBS), Cells were then lysed by adding 30 uX of cell 
lysis buffer (20 mM Tris pH 7.5, 5 mM MgCL 2 , 20 ng/fil tRNA, 0.5% Nonidet NP-40) 
and incubating at room temperature for 5 minutes. 20 jil of each lysate was transferred 
into a 96-weIl microplate (MJ Research). The plate was covered to prevent volume loss 
due to evaporation, and cellular nucleases were inactivated by heating the microplate at 

25 80°C fori 5 minutes prior to the INVADER reaction. 

Total RNA was isolated with Trizol reagent (Gibco BRL, Cat J 1 5596) from 
stimulated and unstimulated cells following the manufacturer's protocol. Cells were 
grown in 10 cm plates to approximately 6-7x1 0 6 cells/plate and treated for 2 hours with 
TNF-a and IL-ip, both at lOng/ml. The RNA was then suspended in RNAse free 

30 distilled water (USB Cat # US70783) and stored at -70°C. 
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In the following experiment 3 different INVADER assay formats were used. The 
multiplex, cascading reaction format was used to detect each analyte; the non-multiplex, 
(single) cascading reaction format was also used to detect each analyte; and a basic 
INVADER (non-cascading) reaction format was used for hMCPl detection only. All of 
5 the formats used the positively charged, labeled probes of the present invention as the 
detection moiety. Detection of each analyte was performed using total RNA, cell lysates 
and in vitro transcripts. 

Target levels for the single and multiplex cascade reactions, as well as for the 
basic, non-cascading INVADER reaction were: either 0 or 1 fmole of in vitro transcript 

10 in 5 \x\; 5 u.1 of cell lysate (approximately 2000 cells); or 50 ng of total RJNA in 5 uJ. 

The multiplex, cascading reaction were prepared as described in Example 14 and 
included all the oligonucleotides required to detect both targets. The cascading reactions 
performed to detect only one target were prepared as described in Example 14, except the 
oligonucleotides required for the detection of only one of the targets (either hUbiquitin or 

1 5 hMCPl) were added, not both. The basic, non-cascading 1NVADE& reactions were 
prepared as described in Example 10. 

The products of the INVADER reaction were separated on reverse polarity gel 
electrophoresis (positively charged cleavage products) or normal polarity gel 
electrophoresis (full length probes) and the gels were scanned on the Hitachi FMBIO-100 

20 fluorescence imager with 585 -nM filter at 20% sensitivity. 

Images of the normal and reverse polarity gels are shown in Figures 30A and B. 
The normal polarity images are shown as panels below the reverse polarity panels, with 
the lanes showing the products of the same reactions aligned vertically. Lanes 1-4 show 
results with either 0 (noted by the - symbol) or 1 fmole (noted by the + symbol) of in 

25 vitro transcript; lanes 5-8 show results using cell lysates (approximately 2000 cells per 

reaction) with either no cellular stimulation (noted by the - symbol) or 4 hours of cellular 
stimulation (noted by the + symbol) prior to the lysate preparation; lanes 9-12 show 
results using approximately 50 ng per reaction total RNA with either no cellular 
stimulation (noted by the - symbol) or with 4 hours of cellular stimulation (noted by the 

30 + symbol) prior to the total RNA preparation. Lanes 1-3, 5-7 and 9-1 1 show the results 
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of the cascading reaction; lanes 4, 8 and 12 show the results of the basic, non-cascading 
reaction. 



EXAMPLE 16 

5 Detection of Positively Charged, Labeled Oligonucleotide Tags by Capillary 

Electrophoresis 

Capillary electrophoresis (CE) is an extremely useful tool that can be used for fast, 
and effective separation of a wide variety of molecules, including DNA oligonucleotides 

10 (Baker, D.R. (1995) Capillary Electrophoresis, Wiley Interscience Publications, New 

York, USA), herein incorporated by reference in its entirety, CE offers the advantages of 
high sensitivity, ease of use, and low cost. It provides a fast and effective method for the 
detection of dye-labeled tags, using, for example laser induced fluorescence. Most of the 
commercially available CE instruments are also capable of charge reversal 

1 5 electrophoresis (CRE). Therefore, it was decided to employ CRE as a method to detect 
the positively-charged tags generated by the invasive cleavage reactions, described and 
demonstrated above. 

An interesting feature of the different, positively charged tags (e.g., products of an 
INVADER assay reaction using CRE probes) is their low charge-to-mass ratio. The 

20 oligonucleotide-positive charge tags used in this study have a net charge of +1 and a mass 
slightly higher than that of a DNA nucleotide base. Thus, it would be extremely difficult 
to use the conventional CE-based DNA separation methods (such as gel -filled capillaries) 
because the injection times required for appropriate sample delivery would result in line 
broadening and poor sensitivity. 

25 Therefore, other CE techniques, such as hydrodynamic injection and sample 

stacking using charged zone electrophoresis (CZE), and micellar electrokinetic capillary 
electrophoresis (MECC or MEKCC) (Weinberger, R. (1993) Practical capillary 
electrophoresis, Academic Press, San Diego, U.S.A, herein incorporated by reference in 
its entirety) were employed to achieve the sensitivity and resolution required for 

30 separation of the positively charged, tagged oligonucleotides. 
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The following examples demonstrate optimization of experimental conditions for 
MECC-CE based separation of the positively charged tagged oligonucleotides generated 
by INVADER reactions. 

5 Optimizations of CRE Conditions: Detection of Positively Charged Oligonucleotide 
Tags 

In order to determine the optimal conditions for running CRE experiments using 
capillary electrophoresis employing sample stacking and micellar electrokinetic capillary 

1 0 electrophoresis (MECC), a number of variables were tested. The variables were 

determined to have the greatest effect on the resolution and sensitivity of detection of 
INV ADER-cleaved tag products. The CRE probes were synthesized as described in 
Examples 4-6. The tags are depicted top to bottom in Figure 17, and are called Tag 6, 
Tag 3, Tag 5, Tag 4, Tag 1 and Tag 2, respectively. The INVADER assay reactions used 

1 5 in these to release these tags were conducted using the oligonucleotides, target DNAs, 
probes and conditions described in Example 18. 

Unless otherwise indicated, all experiments described below were performed on a 
Beckman-Coulter P/ACE MDQ capillary electrophoresis system equipped with a YAG 
532 nm laser (JDS Uniphase) and a 580 ± 10 nm emission filter (Andover Corporation, 

20 Cat.#580FS 10-1 2.5). 100 micron eCAP (Beckman-Coulter) capillary (10 cm to window) 
was run at 25°C with a constant separation voltage of 25 kV, using a separation buffer of 
50 mM Bis-Tris borate pH 6.5. The capillary was pre-filled with 50 mM Bis-Tris borate 
pM 6.5 and 2% octylglucaside. The injected sample consisted of 10 nM final 
concentration mixture of the 6 tags in lOmM MOPS, 0.05% NP40, 0.05% Tween 20, 7.5 

25 mM MgCh, and 10 ng/uL tRNA, and was hydrodynamically injected into the capillary 
using a vacuum injection of 0.5 psi from the positive electrode side of the capillary. The 
sample was run from the positive electrode capillary end to the negative electrode 
capillary end, for a distance of 1 0 cm to the capillary window. Data is represented as 
stacked traces of the raw CE chromatographs without any calculations or manipulations. 

30 
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1) Effect of Sample Buffer Components on CE Resolution: 

Since sample stacking relies on the conductivity and ionic strength differences 
between the sample buffer and the separation buffer, the effect of INVADER reaction 
buffer components on the efficiency of stacking was initially tested. To do this, 10 nM 
5 concentrations of each of the 6 tags were mixed in buffers containing water (A), 10 mM 
MOPS (B), lOmM MOPS, 0.05% NP40, and 0.05% Tween 20 (C), lOmM MOPS, 0.05% 
NP40, 0.05% Tween 20, and 7.5 mM MgCl 2 (D), lOmM MOPS, 0.05% NP40, 0.05% 
Tween 20, 7,5 mM MgCI 2 , and 10 ng/uL tRNA (E), and lOmM MOPS, 0.05% NP40, 
0.05% Tween 20, 7.5 mM MgCJ 2 , 10 ng/uL tRNA, and 10 ng/uX Afu FEN1 nuclease 

10 (F). Results are shown in Fig. 31. 

It can be seen that the suggested minimal sample buffer components for optimal 
stacking and sensitivity arc the presence of detergents (0.05% NP40 and Tween 20) along 
with lOmM MOPS. Sample in water or 50 mM MOPS did not achieve any detection 
suggesting that the presence of detergent is important to the method. It can also be seen 

1 5 that sample buffer F still allows for good resolution and detection sensitivity. Since the 
INVADER reactions are carried out in sample buffer F, no sample treatment (i.e. 
desalting or concentrating) is required prior to running CRE. 

2) Injection time effects: 

20 Effective sample stacking is highly dependent on the volume injected into the 

capillary (Weinberger, R. Practical capillary electrophoresis, Academic Press, San Diego, 
U.S. A [1993]). In this experiment, the optimal (maximum) injection volume of sample 
was determined. The injected sample volume that gave the best resolution was then used 
in subsequent experiments. 

25 Samples were injected using a 0.5 psi vacuum for periods of 10, 20, 30, 40, and 

60 seconds. Results are shown in Fig. 32 (A, B, C, D, and E, respectively). Results show 
that 10 to 40 seconds injection resulted in an increase in sensitivity. However, 
somewhere between 40 and 60 seconds a loss in resolution is apparent, suggesting that 
stacking is no longer optimal. Therefore a 40 second injection time was used for all 

3 0 subsequent experiments. 
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3) Effect of capillary type: 

The electroendosmotic flow (EOF) of CE is very dependent on the type of 
capillary coating used (Weinberger, {supra)). Commonly used bare-fused silica 
capillaries have an EOF that may cause problems for certain CE applications (Baker, 
5 D.R. Capillary Electrophoresis, Wiley Interscience Publications, New York, USA 

[1995]). Coated capillaries are usually used as a solution to the EOF problem. There are 
two different types of coatings, dynamic and static. Dynamic coating is usually achieved 
by adding a surfactant to the capillary filling buffer. This surfactant interacts with the 
silanol groups of the capillary wall, minimizing the EOF. Static coating, on the other 

10 hand, is achieved by pre-treating the bare-silica capillary with a chemical that reacts with 
the hydroxyls of the silanol groups coating the capillary wall, thus making it neutral and 
eliminating the EOF. In order to determine the best coating material for optimal CRE 
performance several statically coated capillaries were tested. Capillaries tested were: A) 
lOOu, eCAP DNA poly aery I amide coated capillary (Beckman-Coulter); B) 75 ji CEP 

15 coated capillary (Agilent Technologies); C) 75u. uSIL-Wax coated capillary (J&W 
Scientific); D) 75u. 5%T, 5%G pre-filled uPAGE capillary (J&W Scientific); E) 75 \x 
bare fused silica (Beckman-Coulter) (Fig. 33). Results show that capillaries with 
hydrophilic coatings (i.e. polyacrylamide 100 u. eCAP and 75u, CEP) yield the best 
separation and sensitivity. This suggests that with the appropriate coating material 

20 (dynamic or static), bare-silica can be efficiently used to resolve CRE-based INVADER 
assays. 

4) Separation (electrode) and capillary filling buffer effects on CRE 

To determine the ionic strength of the separation buffer that will yield maximum 
25 sample stacking, CRE was performed on INVADER assay tag products using 50 mM 

concentrations, pH 7.2 of: (A) Bis-Tris.borate, (B) Tris-borate, and (C) MOPS. For these 
experiments, the capillary was filled with the same buffer as the separation buffer, with 
the addition of 2% octylglucoside to achieve MECC conditions. Fig. 34 shows the results 
of the different buffers used. Optimal stacking is obtained for the buffer containing 50 
30 mM Bis-Tris borate, pH 7.2. Next, the pH of this buffer was optimized for use in 

subsequent CE experiments. The buffer pHs tested were: 50 mM Bis-Tris borate buffers 
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of (A) pH 6.0, (B) 6.5, and (C) 7.2. Results are shown in Fig. 35. Optimal sample 
stacking and separation of INVADER-generated positive tags are obtained at pH 6.5. 

Finally, to determine the optimal concentration of Bis-Tris. borate buffer to be 
used, concentrations of 25 mM (A), 50 mM (B), and 100 mM (C) - all at pH 6.5 - were 
5 tested (Fig. 36). Results indicate that the optima] concentration of Bis-Tris borate is 50 
mM. The use of non-borate based buffers such as TAE, phosphate, and citrate, for 
example, are also contemplated. 

jj 5) Effect of Detergent on the Efficiency of MECC Separation of 

1 0 INVADER assay-generated Positive Tags 

MECC takes advantage of interactions between the sample to be separated by CE 
and the hydrophilic charged ends of micelles commonly formed by detergent 
(Weinberger, supra). To determine which micelle-forming detergent would give optimal 
results, a number of different detergents were tested. CRT was performed using 
15 capillaries filled with 50 mM Bis-Tris borate, pH 6,5 buffer (A) without any detergent 
additions; (B) with 2% octylglucoside; (C) 2% NP-40; (D) 2% Tween-20; (E) 2% Triton 
XI 00; (F) 2% MEGA-9; (G) 2% Brij 35; and (H) 30 mM Sodium Chelate. 

Results are shown in Fig. 37. It can be seen that optimal MECC resolution is 
obtained in the presence of 2% octylglucoside and that the use of NP-40, Tween-20, 
20 Triton XI 00, and Brij 35 result in lower resolution. The use of MEGA-9 and sodium 
cholatc resulted in no sample detection. It is also worth noting that the presence of no 
, detergent produced a single peak of poor resolution suggesting that sample stacking was 

still successful. 

25 EXAMPLE 17 

Analysis of H-phosphonate modifications by using Capillary Electrophoresis. 

In this example, the products of the INVADER reactions using H-phosphonate 
tags described above (e.g., in Example 10) were analyzed by capillary electrophoresis 
(CE). Compared with gel electrophoresis, capillary electrophoresis offers higher 
30 sensitivity and resolution, faster separation time, automation capabilities and the ability to 
use conditions that cannot be applied to a gel format, such as MECC. 
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Four net positively charged tags 5'-V-(Hex)-Cy3-C-3', 5'-V-(dA)'Cy3-C-3', 5'-V- 
(dG)-Cy3-C-3', and 5'-V-(dT)-Cy3-C-3' were generated by the invasive cleavage of the 
corresponding probes, as described in Example 10 (SEQ ID NOS:61-65, respectively). 
Briefly, 10 pmole of each probe oligo (PI, P2, P4 and P5) were cleaved in the presence of 
5 10 fmole of human MCP1 in vitro transcripts for 3 hours to ensure nearly complete 

conversion of the probes to the cleaved products. The cleaved tags were diluted to 10 nM 
concentration using a solution containing 10 mM MOPS, pH 7,5, 7.5 mM MgCb, 10 
ng/f.iL tRNA (Sigma), 0.05% Tween 20, and 0.05% Nonidet P40 to mimic the buffer 
conditions of INVADER reaction. The samples were separated in 60 cm eCAP DNA 

10 1 00 fim diameter capillary (Beckman) under conditions of micellar electrokinetic 

chromatography (MECC) using a PageMDQ CE instrument (Beckman) equipped with a 
532-nm laser and 580±20 nm emission filter. The capillary-filling buffer contained 50 
mM bis-Tris-borate, pH 6.5 and 2% octylglucoside (Sigma) and the electrode buffers 
contained 50 mM bis-Tris borate, pH 6.5. The samples were injected by applying 0.5 psi 

1 5 vacuum to the outlet end of the capillary for 20 seconds. The tags were separated by 
applying 16 kV electric field, with the positive electrode connected to the inlet buffer. 
The separation distance from the inlet end of the capillary to the detector window was 10 
cm. 

Fig. 38 shows MECC profiles for the four net positively charged tags 5 -V- 
20 (HEX)-Cy3-C-3\ 5'-V-(dA)-Cy3-C-3', 5'-V-(dG)-Cy3-C-3* 9 and 5*-V-(dT)-Cy3-C-3' 
separated individually and as an equimolar mixture of all four molecules. Tag 5*-V- 
(Hex)-Cy3-C-3' produced a single band, whereas each of the tags 5'-V-(dA)-Cy3-C-3\ 5'- 
V-(dG)-Cy3-C-3\ and 5 , -V-(dT)-Cy3-C-3' demonstrated two major peaks. The double- 
peak profiles can be explained by the presence of diastereoisomers formed during the 
25 synthesis of each of the studied tags. The stereoisomers formed by tag 5'-V-(Hex)-Cy3- 
C-3* are not separated under these experimental conditions. The separation of a mixture 
of all four tags shows only four peaks rather than expected seven peaks, suggesting that 
some tags or diastereoisomers have similar mobilities in these conditions. 

It was observed that resolution of eCAP DNA capillaries gradually decreases after 
30 10-20 runs, which could affect the separation of tags mixture shown in Fig. 38. When a 
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fresh capillary was used to analyze the same mixture of the four tags, all seven peaks 
were observed under the same conditions (Fig. 39). 

EXAMPLE 18 

5 Separation of net positively charged tags synthesized using phosphoramiditc 

chemistry. 

Synthesis of charge-balanced oligonucleotides can be performed using a 
phosphoramidite chemistry as described in Examples 4-6. In comparison with H- 

10 phosphonate chemistry used for the tags described in Examples 7 and 8, the 

phosphoramidite chemistry offers the advantage of using commercially available 
synthesizers and avoiding the introduction of centers of chirality at the phosphoramidate 
phosphorus atom during the synthesis. Six oligonucleotides with a general structure 5- 
TagN-GCT CCC GCA GAC AC-3' (SEQ ID NO:83), where TagN denotes one of the six 

15 net positively charged modifications described in Examples 4-6, (shown top to bottom in 
Figure 17, and called Tag 6, Tag 3, Tag 5, Tag 4, Tag 1 and Tag 2, respectively). Each 
probe was cleaved in an invasive cleavage reaction with the INVADER oligonucleotide 
5'-CAA AGA AAA GCT GCG TGA TGA TGA AAT CGC-3' (SEQ ID NO:84, termed 
509-54-3) and the target oligonucleotide 5'-GAA GGT GTC TGC GGG AGC CGA TTT 

20 CAT CAT CAC GCA GCT TTT CTT TGA GG-3' (SEQ ID NO:85, termed 509-54-1) to 
generate net positively charged tags 5'-TagN-G-3\ 

Each INVADER assay reaction was performed with 2 jjiM of one of the six 
probes, 0.1 u.M INVADER oligonucleotide 509-54-3, 10 nM target oligonucleotide 509- 
54- 1 , and 1 00 ng of Ave FEN 1 CLEAVASE enzyme (at 1 Ong/u.1) in a 10 u,L solution 
' 25 containing 10 mM MOPS, pH 7.5, 7.5 mM MgCl 2 . The reactions were incubated at 63°C 
for 3 hours. Under these conditions, nearly all the probe molecules were cleaved 
generating approximately 2pM of each positively charged tag. The cleaved products 
were diluted to 10 nM concentration in a solution containing 10 mM MOPS, pH 7.5, 7.5 
mM MgCl 2 , 10 ng/uL tRNA (Sigma), 0.05% Tween 20, and 0.05% Nonidel P40 and 

30 analyzed by MECC as described in Examples 1 6 and 17. 
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Fig. 40 shows MECC profiles for each of the six net positively charged tags 
separated individually or as an equimolar mixture of all six molecules. Each of the tags 
produced a single peak, confirming the absence of chirality centers from the 
modifications. The MECC separation of the mixture of all six tags shows six peaks, 
5 indicating that the CE conditions described here are able to detect the differences in 
chemical structure of all six tags bearing net positively charged modifications. 
Separation demonstrating the power of the MECC assay is emphasized by the fact that 
modifications in two pairs of tags, Tagl/Tag2 and Tag4/Tag5, are composed of identical 
chemical building blocks differing only in the order of attachment, and therefore have an 

10 identical chemical composition. Nonetheless, they were easily resolved, demonstrating 
that the order of addition can be used as an additional variable, further extending the 
library of tags that can be configured from a collection of simple building blocks. 

Superior resolution of MECC assay compared with gel electrophoresis is 
demonstrated in Fig. 41. Samples containing 0.2 pmol of 5'-Tagl-G-3' or 5 , -Tag2-G-3* in 

15 2 uL of 95% formamide, 20 mM EDTA and 0.02% methyl violet were loaded on a 

100x100x2 mm slab of 20% denaturing polyacrylamide gel (crosslinked 19:1) with 7 M 
urea in a buffer containing 45 mM Tris-borate, pH 8.3 and 1 mM EDTA Fig, 41 A) or on 
a 100x100x2 mm slab of 10% native polyacrylamide gel (crosslinked 19:1) in a buffer 
containing 50 mM bis-Tris-borate, pH 6.5 (Fig. 4 IB). The samples were separated by 

20 applying an electric field of 5 watts power for 30 minutes with the positive electrode 
connected to the top buffer reservoir (reverse orientation). The tags were visualized 
using FMBICMOO fluorescence imager as described in Example 9. Fig. 41 A shows that 
S'-Tagl-G^' or 5'-Tag2-G-3 1 have very low mobility under the conditions of the 
denaturing gel, precluding their identification based on this characteristic. Under the 

25 native conditions (Fig. 4 IB), each of the net positively charged tags was separated as two 
bands. There was no significant difference in the electrophoretic mobility between the 
two tags to distinguish them from each other. 



All publications and patents mentioned in the above specification are herein 
30 incorporated by reference. Various modifications and variations of the described 

methods and systems of the invention will be apparent to those skilled in the art without 
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departing from the scope and spirit of the invention. Although the invention has been 
described in connection with specific preferred embodiments, it should be understood 
that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention 
which are obvious to those skilled in the relevant fields are intended to be within the 
scope of the following claims. 
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CLAIMS 
We claim: 

5 1 . A composition comprising a charge tag attached to a terminal end of a 

nucleic acid molecule, said charge tag comprising a phosphate group and a positively 
charged moiety. 

2. The composition of Claim 1, wherein said charge tag further comprises a 

10 dye. 

3. The composition of Claim 2, wherein said dye is positioned between said 
nucleic acid and said positively charged moiety. 



1 5 4. The composition of Claim 2, wherein said positively charged moiety is 

positioned between said nucleic acid and said dye. 

5. The composition of Claim 1 , wherein said charge tag further comprises a 
second positively charged moiety. 

20 

6. The composition of Claim 1, wherein said charge tag has a net positive 
charge of 1 at pH 6-10. 

7. The composition of Claim 1, wherein said charge tag has a net positive 
2S charge of 2 at pH 6-10. 

8. The composition of Claim 1 , wherein said charge tag further comprises 
one or more nucleotides. 
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9. The composition of Claim 8, wherein said nucleic acid molecule 
comprises a sequence complementary to a target nucleic acid, wherein said one or more 
nucleotides of said charge tag are not complementary to said target nucleic acid. 

5 10. The composition of Claim 1 , wherein said nucleic acid comprises a first 

portion complementary to a target nucleic acid and a second portion that is hot 
complementary to said target nucleic acid, wherein said second portion comprises said 
terminal end. 

10 11. The composition of Claim 1, wherein said nucleic acid and said charge tag 

have a combined net neutral charge and wherein said charge tag has a net positive charge. 

12. The composition of Claim 1, wherein said nucleic acid and said charge tag 
have a combined net negative charge and wherein said charge tag has a net positive 

15 charge. 

13. The composition of Claim 1, wherein said charge tag contains a primary 

amine. 

20 14. The composition of Claim 1, wherein said charge tag contains a secondary 

amine. 

15. The composition of Claim 1, wherein said charge tag contains a tertiary 

amine. 

25 

16. The composition of Claim 1, wherein said charge tag contains an 
ammonium group. 

1 7. The composition of Claim 1 , wherein said charge tag further comprises a 
30 positively charged phosphoramidite. 
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1 8. The composition of Claim 1 , wherein said charge tag further comprises a 
neutral phosphoramidite. 

1 9. A composition comprising a positively charged phosphoramidite. 



20. The composition of Claim 19, wherein said phosphoramidite comprises a 
primary amine group. 

21. The composition of Claim 19, wherein said phosphoramidite comprises a 
1 0 secondary amine group. 

22. The composition of Claim 19, wherein said phosphoramidite comprises a 
tertiary amine group. 

15 23. The composition of Claim 19, wherein said phosphoramidite comprises an 

ammonium group. 

24. The composition of Claim 19, wherein said phosphoramidite has a net 
positive charge of one. 



5 



20 



25. The composition of Claim 19, wherein said phosphoramidite has the 



structure: 




25 



wherein, X is a reactive phosphate group and Y is a protected hydroxy 1 group. 



26. A composition comprising a nucleic acid molecule, said nucleic acid 
molecule comprising a positively charged phosphoramidite. 



30 
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27. A composition comprising a charge tag attached to a terminal end of a 
nucleic acid molecule, said charge tag comprising a positively charged phosphoramidite. 

28. The composition of Claim 27, wherein said positively charged 

5 phosphoramite comprise an amine group, wherein said amine group is not further 
attached to another molecule. 

29. A composition comprising a neutrally charged phosphoramidite, wherein 
said neutrally charged phosphoramidite comprises a nitrogen-containing chemical group 

10 selected from the group consisting of secondary amine, tertiary amine, and ammonium 



30. The composition of Claim 29, wherein said composition further comprises 
a nucleic acid molecule attached to said neutrally charged phosphoramidite. 



3 1 . The composition of Claim 30, wherein said nucleic acid molecule is 
attached to a charge tag comprising said neutrally charged phosphoramidite. 

32. The composition of Claim 30, wherein said charge tag further comprises a 
20 positively charged phosphoramidite. 

33. The composition of Claims 32, wherein said charge tag has a net positive 

charge. 

. 25 34. The composition of Claim 29, wherein said phosphoramidite has the 



30 wherein X is a protected hydroxyl group, Z is a reactive phosphate, and N comprises an 
amine group. 



groups. 



15 



structure: 
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35. The composition of Claim 34, wherein N is N-(CH2) n CH3, wherein n is 0 
or a positive integer from 1 to 12. 

5 36. A composition comprising a solid support attached to a charge tag, said 

charge tag comprising a positively charged moiety and a reactive group configured to 
allow said charge tag to covalently attach to a nucleic acid molecule. 

37. A composition comprising a fluorescent dye directly bonded to a 

10 phosphate group, wherein said phosphate group is directly bonded to an amine group. 

38. The composition of Claim 37, wherein said composition comprises a 
charge tag, wherein said fluorescent dye is contained within said charge tag. 

15 39. The composition of Claim 37, wherein said fluorescent dye comprises 

Cy3. 

40. A mixture comprising a plurality of oligonucleotides, each oligonucleotide 
attached to a different charge tag, each of said charge tags comprising a phosphate group 

20 and a positively charged moiety. 

4 1 . The mixture of Claim 40, wherein said plurality of oligonucleotides 
comprises four or more oligonucleotides, each attached to a different charge tag. 

. 25 42. The mixture of Claim 40, wherein said plurality of oligonucleotides 

comprises ten or more oligonucleotides, each attached to a different charge tag. 

43. The mixture of Claim 40, wherein said plurality of oligonucleotides 
comprises twenty or more oligonucleotides, each attached to a different charge tag. 

30 
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44. The mixture of Claim 40, wherein said plurality of oligonucleotides 
comprises fifty or more oligonucleotides, each attached to a different charge tag. 

45. The mixture of Claim 40, wherein said charge tags comprise a dye. 

5 

46. The mixture of Claim 40, wherein said charge tags comprise a second 
positively charged moiety. 

47. The mixture of Claim 40, wherein said charge tags further comprise one or 
10 more nucleotides. 

48. The mixture of Claim 40, wherein each of said oligonucleotides attached 
to a charge tag have a combined net neutral charge and wherein said charge tag has a net 
positive charge. 

15 

48. The mixture of Claim 40, wherein each of said oligonucleotides attached 
to a charge tag have a combined net negative charge and wherein said charge tag has a 
net positive charge. 

20 49. The mixture of Claim 40, wherein said charge tags comprise a positively 

charged phosphoramidite. 

50. The mixture of Claim 40, wherein said charge tags comprise a neutral 
phosphoramidite. 

• 25 

51 . A method of separating nucleic acid molecules, comprising the steps of: 
a) treating a charge-balanced oligonucleotide containing a charge 

tag under conditions such that a charge-unbalanced 
oligonucleotide containing said charge tag is produced, wherein 
30 said charge-unbalanced oligonucleotide is contained in a 

reaction mixture; and 
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separating said charge-unbalanced oligonucleotide from said 
reaction mixture. 



52. The method of Claim 51, wherein said conditions comprise treating said 
5 charge-balanced oligonucleotide with a reactant. 

53. The method of Claim 51, wherein said charge tag is attached to a terminal 
end of said oligonucleotide, said charge tag comprising a phosphate group and a 
positively charged moiety. 

10 

54. The method of Claim 51, wherein said charge tag comprises a dye. 

55. The method of Claim 54, wherein said dye is positioned between said 
oligonucleotide and said positively charged moiety. 

15 

56. The method of Claim 54, wherein said positively charged moiety is 
positioned between said oligonucleotide and said dye. 

57. The method of Claim 53, wherein said charge tag further comprises a 
20 second positively charged moiety, 

58. The method of Claim 51, wherein said charge tag comprises one or more 
nucleotides. 

25 59. The method of Claim 58, wherein said oligonucleotide comprises a 

sequence complementary to a target nucleic acid, wherein said one or more nucleotides of 
said charge tag are not complementary to said target nucleic acid. 

60, The method of Claim 53, wherein said oligonucleotide comprises a first 
30 portion complementary to a target nucleic acid and a second portion that is not 
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complementary to said target nucleic acid, wherein said second portion comprises said 
terminal end. 



61. The method of Claim 51 , wherein said charge-balanced oligonucleotide 
5 has a net neutral charge and wherein said charge-unbalanced oligonucleotide has a net 

positive charge. 

62. The method of Claim* 51 , wherein said charge-balanced oligonucleotide 
has a net negative charge and wherein said charge-unbalanced oligonucleotide has a net 

10 positive charge. 

63. The method of Claim 51, wherein said charge tag contains a primary 

amine. 

1 5 64, The method of Claim 51 , wherein said charge tag contains a secondary 

amine. 

65. The method of Claim 51, wherein said charge tag contains a tertiary 

amine. 

20 

66. The method of Claim 51 , wherein said charge tag contains an ammonium 

group. 

67. The method of Claim 51, wherein said charge tag comprises a positively 
25 charged phosphoramidite. 

68. The method of Claim 51, wherein said charge tag comprises a neutral 
phosphoramidite. 

30 69, The method of Claim 5 i , wherein said separating comprises capillary 

electrophoretic separation. 
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70. The method of Claim 51, wherein said separating comprises capillary zone 
electrophoretic separation. 

5 71. The method of Claim 5 1 , wherein said separating occurs in a 

microchannel. 



72. A method of separating nucleic acid molecules, comprising the steps of: 

a) treating a plurality of charge-balanced oligonucleotides, each 

1 0 containing different charge tags, under conditions such that two or 

more charge-unbalanced oligonucleotides containing said charge tags 
are produced, wherein said charge-unbalanced oligonucleotides are 
contained in a reaction mixture; and 

b) separating said charge-unbalanced oligonucleotides from said reaction 
15 mixture. 



73. The method of Claim 72, wherein said separating comprises separating 
said charge-unbalanced oligonucleotides such that charge-unbalanced oligonucleotides 
containing different charge lags are separated from one another. 

20 

74. The method of Claim 72, wherein said plurality of charge- balanced 
oligonucleotides comprise four or more charge-balanced oligonucleotides comprising 
different charge tags. 



25 75. The method of Claim 72, wherein said plurality of charge-balanced 

oligonucleotides comprise ten or more charge-balanced oligonucleotides comprising 
different charge tags. 



76. The method of Claim 72, wherein said plurality of charge- balanced 
30 oligonucleotides comprise twenty or more charge-balanced oligonucleotides comprising 
different charge tags. 
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77. The method of Claim 72, wherein said plurality of charge-balanced 
oligonucleotides comprise fifty or more charge-balanced oligonucleotides comprising 
different charge tags. 

78. The method of Claim 72, wherein said conditions comprise treating said 
charge-balanced oligonucleotide with a reactant 



79. The method of Claim 72, wherein said charge tags are attached to terminal 
1 0 ends of said plurality of oligonucleotides, said charge tags comprising a phosphate group 
and a positively charged moiety. 



80. The method of Claim 72, wherein said charge tags comprise a dye. 

15 81. The method of Claim 72, wherein said charge tags comprise a positively 

charged phosphoramidite. 

82. The method of Claim 72, wherein said charge tags comprise a neutral 
phosphoramidite. 

20 

83. The method of Claim 72, wherein said separating comprises capillary 
electrophoretic separation. 



84. The method of Claim 72, wherein said separating comprises capillary zone 
25 electrophoretic separation. 

85. The method of Claim 72, wherein said separating occurs in a 
microchannel. 



30 
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FIGURE 3 
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FIGURE 4a 
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FIGURE 4b 
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FIGURE 6 
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FIGURE 9 
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FIGURE 11 
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FIGURE 12 
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FIGURE 14 
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i FIGURE 15 

4 



(0) (+1) 

A. 3'-DNA - (DP) - (PCP) 



'i B. 3'-DNA - (DP) - (NP) n - (PCP) 



(0) (-1) (+1) 
C. 3--DNA - (DP) - (PBB) n - (PCP) (n+1) 



(0) (-1) (+1) (0) 
3'-DNA - (DP) - (PBB) n - (PCP) (n+1) - (NP) 
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j FIGURE 16 
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Neutral Phosphoramidite 
Net Charge : 0 



, — v i k OCE 



Positively Charged Phosphoramidite 
Net Charge : +1 
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FIGURE 17 
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FIGURE 18 
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FIGURE 19 



p{ . s . / Cleavage Site 
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^CTTTTC ACC AGCG AGACGGG - 3 ' 
M13 3'-CGCCAAACGCATAAC.CCGCGGTCCCACCAAAAAGAAAAGTGGTCGCTCTGCCC-5 

TATTGGGCGCCAGGGTGGTTTTT 
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FIGURE 21 
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FIGURE 22 
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FIGURE 23 

A. 

^3S^x:ttcggagtttggg - NH, - 3 ' 

5 ' - GGGTTGTGGAGTGAGTGTTC AAGTA - 3 ' 

3 ' -CCCAACACCUCACUCACAAGUUCAGAAGCCUCAAACCCAAACGAA-5 ' 
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FIGURE 28A 



Primary reaction 

\ } 

XTTC 

5 ' -GGGTTGTGGAGTGAGTGTTCAAGTA-3 ' 
3 ' -CCCAACACCUCACUCACAAGUUCAGAAGCCUCAAACCCAA.^.CGAA-5 ' 



OCTTCGGAGTTTGGG - NH r - 3 ' 



Human MCP-1 mRNA 

5 ' -CCGTCACGCCTCCTTCGGAGTTTGGG-NH 2 -3 « 
3' -NH^.GTGCGGAGGAAGCCTCAAACCCAA - 5 ' 



Secondary reaction 

5'-V(dC)+ I 

^^^x:tcattctcagtgcgo - 
5' -ccgtcacgcctcc 
3 • -nh 2 -uggcagtgcggaggagtaagagtgacgcg- 5 ' 
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FIGURE 28B 



Primary reaction 

5 ' - CCTTCCTTATCCTGGATCTTGGCA- 3 ' 

3 ' -GGAAGGAATAGGACCTAGAACCGGAAATGTAAAAGATAGCATAGGC- 5 * 



Human Ubiquitin mRNA 



5 ' -AACGAGGCGCACX!TTTACATTTTCTATCGT~NH 2 - 3 ' 
3 ' ~ NH? - CGCGTGGAAATGTAAAAGATAGCA - 5 ' 



Secondary reaction 

V(dC 



^ACTTGCTTCCTCC- 3 
5 ' AACGAGGCGCACC 
- NH 2 - AAUUGCTCCGCGTGGTGAACGAAGAAGGC - 
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FIGURE 30A 
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FIGURE 30B 
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FIGURE 38 
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FIGURE 39 
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FIGURE 41 
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SEQUENCE LISTING 

<110> Lyamichev, Victor 

<120> CHARGE TAGS AND SEPARATION OF NUCLEIC ACID MOLECULES 
<130> FORS 06873 
<160> 85 

<rl70> Patentin version 3.0 
<210> 1 



tf j <211> 23 



<212> DNA 

<213> synthetic 
<220> 

<221> misc_feature 

<222> (1)..(2) 

<223> misc. feature 

<400> 1 

tntcttttca ccagcgagac ggg 23 

<210> 2 

<211> 22 

<212> DNA 

<213> synthetic 

<400> 2 

attgggcgcc agggtggttt tt 22 

<210> 3 

<211> 30 

<212> DNA 

<213> synthetic 

<400> 3 

cagggtgaag ggaagaagaa agcgaaaggt 3° 

<210> 4 

<211> 30 

<212> DNA 
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<213> synthetic 

<400> 4 

cagggggaag ggaagaagaa agcgaaaggt 3 0 

<210> 5 
<211> 28 
<212> DNA 
<213> synthetic 
<400> 5 

cacgaat'tcc gaggcgatgc ttccgctc 2 8 

<210> 6 
<211> 30 
<212> DNA 
<213> synthetic 
<400> 6 

tcgacgtcga ctaacccttg gcggaaagcc 3 0 

<210> 7 
<211> 23 
<212> DNA 
<213> synthetic 
<400> 7 

gcatcgcctc ggaattcatg gtc 2 3 

<210> 8 
<211> 836 
<212> PRT 

<213> Thermus thermophilus 
<400> 8 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 



2/47 



Printed from Mimosa 03/05/15 14:59:32 Page: 152 



WO 02/063030 



PCT/US02/03423 



Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
SC St 6C 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 ~ 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

* 

-Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 ~ 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
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3 85 3 90 -3 95 4 0 C 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 ' 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 
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Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
74C 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 9 
<211> 2511 
<212> DNA 

<213> Thermus thermophilus 
<400> 9 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


9 a 999 cc ccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacotccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 
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atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcagggacg 


gccgggtgca 


cc 9S9cagco 


102C 


gaccccttgg 


cggggctaaa 


ggacct caag 


gaggtccggg 


gcctcct cgc 


caagcacctc 


1 OBC 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccctc 


caacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


X260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


catggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttqqcqggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag gcaagcgctc 


caccagcgcc 


1560 


qccjqtqctqq 


aggccctacg 


ggaggcccac 

*J ZJ ZJ 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acqqqccqcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


qccttcqtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 

-m* ZJ Zj ZJ 


aacctgatca 


gggtcttcca ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


qggcggccaa 


qacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aqqaqcttqc 


catc.ccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacgacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtta 


9 


2511 



<210> 10 
<211> 26 
<212> DNA 

<213> synthetic 
<400> 10 

caggaggagc tcgttgtgga cctgga 2 6 
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<210> 




























<211> 


836 


























<212> 


PRT 


























<213> 


Thermus ' 


thermophilus 


















<400> 


11 


























Met 
1 


Asn 


Ser 


Glu 


Ala 
5 


Met 


Leu 


Pro 


Leu Phe 
10 


Glu 


Pro 


Lys 


Gly 


Arg 
15 


..V 


Leu 


Leu 


Val 


Asp 
20 


Gly 


His 


His 


Leu 


Ala Tyr 
25 


Arg 


Thr 


Phe 


Phe 
30 


Ala 




Lys 


Gly 


Leu 

35 


Thr 


Thr 


Ser 


Arg 


Gly 
40 


Glu Pro 


Val 


Gin 


Ala 
45 


Val 


Tyr 




Phe 


Ala 
50 


Lys 


Ser 


Leu 


Leu 


Lys 
55 


Ala 


Leu Lys 


Glu 


Asp 
60 


Gly 


Tyr 


Lys 




Val 
65 


Phe 


Val 


Val 


Phe 


Asp 
70 


Ala 


Lys 


Ala Pro 


Ser 
75 


Phe 


Arg 


His 


Glu 




Tyr 


Glu 


Ala 


Tyr 


Lys 
85 


Ala 


Gly 


Arg 


Ala Pro 
90 


Thr 


Pro 


Glu 


Asp 


Phe 

95 




Arg 


Gin 


Leu 


Ala 
100 


Leu 


He 


Lys 


Glu 


Leu Val 
105 


Asp 


Leu 


Leu 


Gly 
110 


Phe 


■/i 
% 


Arg 


Leu 


Glu 
115 


Val 


Pro 


Gly 


Tyr 


Glu 
120 


Ala Asp 


Asp 


Val 


Leu 
125 


Ala 


Thr 




Ala 


Lys 
130 


Lys 


Ala 


Glu 


Lys 


Glu 
135 


Gly 


Tyr Glu 


Val 


Arg 
140 


He 


Leu 


Thr 




Asp 

T A C 


Arg 


Asp 


Leu 


Tyr 


Gin 
150 


Leu 


Val 


Ser Asp 


Arg 
155 


Val 


Ala 


Val 


Leu 




Pro 


Glu 


V3 jl y 


His 


165 


JL JlC 




Pro 


Glu Trp 
170 


Leu 


Trp 


Glu 


Lys 


Tyr 
175 




Leu 


Arg 


Pro 


Glu 
180 


Gin 


Trp 


Val 


Asp 


Phe Arg 
185 


Ala 


Leu 


Val 


Gly 
190 


Asp 




Ser 


Asp 


Asn 
195 


Leu 


Pro 


Gly 


Val 


Lys 

200 


Gly He 


Gly 


Glu 


Lys 
205 


Thr 


Ala 




Lys 


Leu 
210 


Leu 


Lys 


Glu 


Trp 


Gly 
215 


Ser 


Leu Glu 


Asn 'Leu 
220 


Leu 


Lys 


Asn 




Asp 
225 


Arg 


Val 


Lys 


Pro 


Glu 
230 


Asn 


Val 


Arg Glu 


Lys 
235 


He 


Lys 


Ala 


His 




Glu 


Asp 


Leu 


Arg 


Leu 
245 


Ser 


Leu 


Glu 


Leu Ser 
250 


Arg 


val 


Arg 


Thr 


Asp 
255 




Pro 


Leu 


Glu 


Val 
260 


Asp 


Leu 


Ala 


Gin 


Gly Arg 
265 


Glu 


Pro 


Asp 


Arg 
270 


Glu 




Leu 


Arg 


Ala 
275 


Phe 


Leu 


Glu 


Arg 


Leu 
280 


Glu Phe 


Gly 


Ser 


Leu 
285 


Leu 


His 



80 



160 



240 
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Phe Gly Leu Leu Glu 
2 90 

Fro Pro Glu Gly Ala 
305 

Met Trp Ala Glu Leu 
325 

His Arg Ala Ala Asp 
340 

Arg Gly Leu Leu Ala 
355 

Leu Asp Leu Val Pro 
370 

Asp Pro Ser Asn Thr 
38S 

Glu Trp Thr Glu Asp 
405 

His Arg Asn Leu Leu 
420 

Leu Tyr His Glu Val 
435 

Glu Ala Thr Gly Val 
450 

Leu Glu Leu Ala Glu 
465 

Leu Ala Gly His Pro 
485 

Val Leu Phe Asp Glu 
500 

Thr Gly Lys Arg Ser 
515 

Ala His Pro lie Val 
530 

Leu Lys Asn Thr Tyr 
545 

* Thr Gly Arg Leu His 
565 

Arg Leu Ser Ser Ser 
580 

pro Leu Gly Gin Arg 
595 

Ala Leu Val Ala Leu 
610 

His Leu ser Gly Asp 



Ala Pre Ala Pre Leu Glu 

29^ 

Phe Val Gly Phe Val Leu 

310 315 

Lys Ala Leu Ala Ala Cys 
330 

Pro Leu Ala Gly Leu Lys 
345 

Lys Asp Leu Ala Val Leu 
360 

Gly Asp Asp Pro Met Leu 
375 

Thr Pro Glu Gly Val Ala 
390 395 

Ala Ala His Arg Ala Leu 
410 

Lys Arg Leu Glu Gly Glu 
425 

Glu Lys Pro Leu Ser Arg 
440 

Arg Arg Asp Val Ala Tyr 
455 

Glu lie Arg Arg Leu Glu 
470 475 

Phe Asn Leu Asn Ser Arg 
490 

Leu Arg Leu Pro Ala Leu 
505 

Thr Ser Ala Ala Val Leu 
520 

Glu Lys lie Leu Gin His 
535 

Val Asp Pro Leu Pro Ser 
550 555 

Thr Arg Phe Asn Gin Thr 
570 

Asp Pro Asn Leu Gin Asn 
585 

lie Arg Arg Ala Phe Val 
600 

Asp Tyr Ser Gin He Glu 
615 

Glu Asn Leu lie Arg Val 



Glu Ala Pre Trp Pre 
300 

Ser Arg Pro Glu Pre 
32 0 

Arg Asp Gly Arg Val 
335 

Asp Leu Lys Glu Val 
350 

Ala Ser Arg Glu Gly 
365 

Leu Ala Tyr Leu Leu 
380 

Arg Arg Tyr Gly Gly 
400 

Leu Ser Glu Arg Leu 
415 

Glu Lys Leu Leu Trp 
430 

Val Leu Ala His Met 
445 

Leu Gin Ala Leu Ser 
460 

Glu Glu Val Phe Arg 
480 

Asp Gin Leu Glu Arg 

' 4 95 

Gly Lys Thr Gin Lys 
510 

Glu Ala Leu Arg Glu 
52S 

Arg Glu Leu Thr Lys 
540 

Leu Val His Pro Arg 
560 

Ala Thr Ala Thr Gly 
575 

He Pro Val Arg Thr 
590 

Ala Glu Ala Gly Trp 
605 

Leu Arg Val Leu Ala 
620 

Phe Gin Glu Gly Lys 



8/47 



Printed from Mimosa 03/05/15 14:59:49 Page: 158 



WO 02/063030 



PCT/US02/03423 



625. 63 C 63 5 6 4C 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 12 
<211> 2511 
<212> DNA 

<213> Thermus thermophilus 
<400> 12 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 18 0 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 3 00 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 
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gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 42c 

atccicaccg ccgaccgcga cccctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 54 0 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 66 0 

ctcaagaacc cggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 72 0 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 78 0 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 96 o 

atgtgggcgg agcttaaagc cctggccgcc tgcagggacg gccgggtgca ccgggcagca 1020 

gacccctcgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 108 0 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccctc caacaccacc cccgaggggg tggcgcggcg ctacgggggg 12 00 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 126 0 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca catggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt ggggaagacg caaaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2 220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 228 0 



10/47 



Printed from Mimosa 03/05/15 14:59:53 Page: 160 



WO 02/063030 PCT/US02/03423 



gccgccgacc 


t catgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtta 


g 


2511 


<210> 13 














<211> 58 














<212> DNA 














<213> synthetic 












<400> 13 














tgcctgcagg 


tcgacgctag 


ctagtggtgg 


tggtggtggt 


gacccttggc 


ggaaagcc 


58 


<210> 14 














<211> 2526 












<212> DNA 














<213> Thermus thermophilus 










<400> 14 














atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa* 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


a ggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 
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atgtgggcgg 


agct: taaagc 


cct 9Qccgcc 


tgcagggacg 


geegggtoca 


ccgggcagca 


1C2C 


gacccct egg 


eggggctaaa 


ggacctcaag 


gaggt ccggg 


gcctcctcgc 


caaggacctc 


106C 


geegtcttgg 


cctcgaggga 


9gggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1X40 


gcctacctcc 


tggacccctc 


caacaccacc 


cccgaggggg 


tggcgcggcg 


etaegggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagece 


1320 


ctctcccggg 


tcctggccca 


catggaggee 


aceggggtae 


ggcgggacgt 


ggcctaccfct 


1380 


caggcccttt 


ccctggagct 


tgeggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg cttcaaccag acggccacgg 


ecaeggggag gcttagtagc 


1740 


tccgacccca 


acctgeagaa 


catccccgtc 


cgcaccccct 


tgggecagag gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg gtggccctgg 


actatageca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


eggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa gaeggtgaac 


ttcggcgtcc 


tetaeggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggegg 


tggectttat 


agagegctae 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


eggggctacg 


tggaaaccct 


etteggaaga 


aggegctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


crggctttggc 


caaggaggee 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


. ctggaggtgg aggtggggat gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 15 

<211> 842 

<212> PRT 

<213> Thermus thermophilus 

<400> 15 
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Met Asn Ser Glu Ala Met Leu Pre Leu Phe Glu Pre Lys Gly Arg Va: 
: £ 10 1 1 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 * 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

" Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 3X5 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 



13/47 



Printed from Mimosa 03/05/15 14:59:59 Page: 163 



WO 02/063030 



PCT/US02/03423 



34C 34£ 35C 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 36b 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 " " 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 

435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 t 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 
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Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 70C 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 16 
<211> 31 
<212> DNA 
<213> synthetic 
<400> 16 

gcctgcaggg gcggccgcgt gcaccggggc a 31 

<210> 17 
<211> 26 
<212> DNA 
<213> synthetic 
<400> 17 

ctcctggacc cttcgaacac cacccc 26 

<210> 18 

<211> 23 

<212> DNA 

<213> synthetic 
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<400> 1G 

gtcctggccc atatggaggc cac 

<210? 19 

<211> 2526 

<212> DNA 

<213> Thermus thermophilics 

<400> 19 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


g^gtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


460 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


t-gcaggggcg gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 
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gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 20 
<211> 842 
<212> PRT 

<213> Thermus thermophilics 
<400> 20 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 
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Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thi 
IOC ' 105 11C 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 ~ 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

' Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
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43£ 44C 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 S25 

Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val. Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

. Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 " 775 780 
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Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
78S 790 795 80C 



Glu Glu val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 * 810 ' 815 



Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 



Ser Ala Lys Gly His His His His His His 
835 840 



<210> 



21 



<211> 



30 



<212> 



DNA 



<213> 



synthetic 



<400> 



21 



caggaggagc tcgttggcga cctggaggag 



30 



<210> 22 

<211> 2526 

<212> DNA 

<213> Thermus thermophilus 

<400> 22 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg gcctcaccac gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg acttcccccg gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa aggaggggta 


cgaggtgcgc 


420 


„ atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta gggccttcct 


ggagaggctg 


B40 
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gage teggea 


gect cctcca 


egagttegge 


ct cctggagg 


cccccgcccc 


cctggaggag 


900 




gccccctggc 


ccccgccgga 


aggggectte 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 




atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


X020 


j 


gaccccttgg 


eggggctaaa 


ggacctcaag 


gaggtceggg 


gcctcctcgc 


caaggacctc 


1080 




geegtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


i 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


etaegggggg 


1200 




gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 




cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagece 


1320 




ctctcccggg 


tcctggccca 


tatggaggee 


aceggggtae 


ggcgggacgt 


ggcctacctt 


1380 




caggcccttt 


ccctggagct 


tgeggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


i 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 




gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 




gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 




gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 




acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ecaeggggag 


gcttagtagc 


1740 




tccgacccca 


acctgeagaa 


catccccgtc 


cgcaccccct 


tgggecagag 


gatccgccgg 


1800 


1 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatageca 


gatagagctc 


1860 




cgcgtcctcg 


cccacctctc 


eggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 




gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 




ctgatgcgcc 


gggcggccaa 


gaeggtgaac 


ttcggcgtcc 


tetaeggcat 


gtccgcccat 


2040 




aggctctccc 


aggagcttgc 


catcccctac 


gaggaggegg 


tggectttat 


agagegctae 


2100 




ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 




eggggctacg 


tggaaaccct 


etteggaaga 


aggegctacg 


tgcccgacct 


caacgcccgg 


2220 




gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 




gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 




gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 




gaggaggtgg 


eggctttgge 


caaggaggee 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 




ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 




caccac 












2526 



<210> 23 
<211> 842 
<212> PRT 
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<213> Thermus thermophilics 
<400> 23 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 3Q 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 5S 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg- Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 " 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 
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Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Ara Glv Gly Ara Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

' Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
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660 665 67C 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 ~ 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
70S 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 24 
<211> 30 
<212> DNA 
<213> synthetic 
<400> 24 

caggaggagc tcgttggcga cctggaggag 3 0 

<210> 25 
'<211> 2526 
<212> DNA 

<213> Thermus thermophilus 
<400> 25 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 
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gggtacaagc 


ccg tcttcgt 


ggt ct t tgac 


gccaaggccc 


cctcct tccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgfcgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 
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4 

-3 



tt ccaaagci 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ygggaggaac 


216C 


cggggccacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct cgccatggtg aagctcttcc 


cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 



<210> 26 

<2ll> 842 

<212> PRT 

<213> Thermus thermophilus 

<400> 26 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 

1 5 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
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20C 



Lys 



Leu 
210 



Leu 



Lys 



Glu 



Trp 



Gly 
215 



Ser 



Leu Glu 



Asn Leu 
220 



Leu 



Lys 



Asn 



Leu 



Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 " 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 " 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 
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Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pre Arc 
545 550 555 . 56 0 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg, Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 €00 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 . 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 6S0 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thx Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 ' 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 27 
<:211> 340 
<212> PRT 
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*213> Pyrococcus furiosuE 
<400> 27 

Met Gly Val Pro lie Gly Glu He lie Pro Arg Lys Glu He Glu Leu 
15 10 15 

Glu Asn Leu Tyr Gly Lys Lys He Ala He Asp Ala Leu Asn Ala He 
20 25 30 

Tyr Gin Phe Leu Ser Thr He Arg Gin Lys Asp Gly Thr Pro Leu Met 
35 40 45 

Asp Ser Lys Gly Arg He Thr Ser His Leu Ser Gly Leu Phe Tyr Arg 
50 55 60 

Thr He Asn Leu Met Glu Ala Gly He Lys Pro Val Tyr Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Glu Phe Lys Lys Lys Glu Leu Glu Lys Arg Arg Glu 
85 90 95 

Ala Arg Glu Glu Ala Glu Glu Lys Trp Arg Glu Ala Leu Glu Lys Gly 
100 105 110 

Glu He Glu Glu Ala Arg Lys Tyr Ala Gin Arg Ala Thr Arg Val Asn 
115 120 ~ 125 

Glu Met Leu He Glu Asp Ala Lys Lys Leu Leu Glu Leu Met Gly He 
130 135 140 

Pro He Val Gin Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met 
145 150 ~ 155 160 

Ala Ala Lys Gly Ser Val Tyr Ala Ser Ala ser Gin Asp Tyr Asp Ser 
165 170 175 

Leu Leu Phe Gly Ala Pro Arg Leu Val Arg Asn Leu Thr He Thr Gly 
180 185 190 

Lys Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Glu He Lys Pro Glu 
195 200 205 

Leu He He Leu Glu Glu Val Leu Lys Glu Leu Lys Leu Thr Arg Glu 
210 215 220 

Lys Leu lie Glu Leu Ala He Leu Val Gly Thr Asp Tyr Asn Pro Gly 
225 230 235 240 

Gly He Lys Gly He Gly Leu Lys Lys Ala Leu Glu He Val Arg His 
245 250 255 

Ser Lys Asp Pro Leu Ala Lys Phe Gin Lys Gin Ser Asp Val Asp Leu 
260 265 270 

Tyr Ala He Lys Glu Phe Phe Leu Asn Pro Pro Val Thr Asp Asn Tyr 
275 280 285 

Asn Leu Val Trp Arg Asp Pro Asp Glu Glu Gly He Leu Lys Phe Leu 
290 295 300 

Cys Asp Glu His Asp Phe Ser Glu Glu Arg Val Lys Asn Gly Leu Glu 
305 310 315 320 
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Ara Leu Lys Lys Ala lie Lys Ser Gly Lys Gin Ser Thr Leu Glu £ej 
325 * 330 33b 

Trp Phe Lys Arg 
340 

<210> 28 
<211> 326 
<212> PRT 

<213> Methanococcus j annas chii 
<400> 28 

Met Gly Val Gin Phe Gly Asp Phe lie Pro Lys Asn lie He Ser Phe 
15 10 15 

Glu Asp Leu Lys Gly Lys Lys Val Ala He Asp Gly Met Asn Ala Leu 
20 25 30 

Tyr Gin Phe Leu Thr Ser He Arg Leu Arg Asp Gly Ser Pro Leu Arg 
35 40 45 

Asn Arg Lys Gly Glu He Thr Ser Ala Tyr Asn Gly Val Phe Tyr Lys 
50 55 60 

Thr He His Leu Leu Glu Asn Asp He Thr Pro He Trp Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Lys Leu Lys Glu Lys Thr Arg Lys Val Arg Arg Glu 
85 90 95 

Met Lys Glu Lys Ala Glu Leu Lys Met Lys Glu Ala He Lys Lys Glu 
100 105 110 

Asp Phe Glu Glu Ala Ala Lys Tyr Ala Lys Arg Val Ser Tyr Leu Thr 
115 120 125 

Pro Lys Met Val Glu Asn Cys Lys Tyr Leu Leu Ser Leu Met Gly He 
130 135 140 

Pro Tyr Val Glu Ala Pro Ser Glu Gly Glu Ala Gin Ala Ser Tyr Met 
145 150 155 160 

Ala Lys Lys Gly Asp Val Trp Ala Val Val Ser Gin Asp Tyr Asp Ala 
165 170 175 

Leu Leu Tyr Gly Ala Pro Arg Val Val Arg Asn Leu Thr Thr Thr Lys 
180 ~ 185 190 

"Glu Met Pro Glu Leu lie Glu Leu Asn Glu Val Leu Glu Asp Leu Arg 
195 200 205 

He Ser Leu Asp Asp Leu He Asp He Ala He Phe Met Gly Thr Asp 
210 215 220 

Tyr Asn Pro Gly Gly Val Lys Gly He Gly Phe Lys Arg Ala Tyr Glu 
225 230 235 240 

Leu Val Arg Ser Gly Val Ala Lys Asp Val Leu Lys Lys Glu Val Glu 
245 250 255 

Tyr Tyr Asp Glu He Lys Arg He Phe Lys Glu Pro Lys Val Thr Asp 
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26C 265 270 

Asn Tyr Ser Leu Ser Leu Lys Leu Pro Asp Lys Glu Gly lie lie Lys 
275 280 285 

Phe Leu Val Asp Glu Asn Asp Phe Asn Tyr Asp Arg Val Lys Lys His 
290 295 300 

Val Asp Lys Leu Tyr Asn Leu lie Ala Asn Lys Thr Lys Gin Lys Thr 
305 310 315 320 

Leu Asp Ala Trp Phe Lys 
325 



<210> 


29 




<211> 


328 




<212> 


PRT 




<213> 


Me t hanoba c t e r ium 


thermoautotrophicum 


<400> 


29 




Met Gly Val Lys Leu Arg 
1 5 


Asp Val Val Ser Pro 
10 



15 

Glu Asp Leu Arg Gly Arg Thr Val Ala Val Asp Ala Ala Asn Thr Leu 
20 25 30 

Tyr Gin Phe Leu Ser Ser He Arg Gin Arg Asp Gly Thr Pro Leu Met 
35 40 45 

Asp Ser Arg Gly Arg Val Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Thr Ala Ala Val Met Glu Arg Glu He Arg Val He Tyr Val Phe Asp 
65 70 75 80 

Gly Arg Ser His His Leu Lys Gly Glu Thr Val Ser Arg Arg Ala Asp 
85 90 95 

lie Arg Lys Lys Ser Glu Val Glu Trp Lys Arg Ala Leu Glu Glu Gly 
100 105 HO 

Asp He Asp Arg Ala Arg Lys Tyr Ala Val Arg Ser Ser Arg Met Ser 
115 . 120 125 

Ser Glu He Leu Glu Ser Ser Lys Arg Leu Leu Glu Leu Leu Gly He 
130 135 140 

Pro Tyr Val Gin Ala Pro Gly Glu Gly Glu Ala Gin Ala Ser Tyr Met 
145 150 155 160 

Val Lys Met Gly Asp Ala Trp Ala Val Ala Ser Gin Asp Tyr Asp Cys 
165 170 175 

Leu Leu Phe Gly Ala Pro Arg Val Val Arg Lys Val Thr Leu Ser Gly 
180 185 190 

Lys Leu Glu Asp Pro His He He Glu Leu Glu Ser Thr Leu Arg Ala 
195 200 205 
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Leu Ser He Ser His Thr Gin Leu Val Asp Met Ala Leu Leu Vaj Gly 
210 215 220 

Thr Asp Phe Asn Glu Gly Val Lys Gly Tyr Gly Ala Arg Arg Gly Leu 
225 230 235 240 

Lys Leu He Arg Glu Lys Gly Asp lie Phe Lys Val He Arg Asp Leu 
245 250 255 

Glu Ala Asp He Gly Gly Asp Pro Gin Val Leu Arg Arg He Phe Leu 
260 265 270 

Glu Pro Glu Val Ser Glu Asp Tyr Glu He Arg Trp Arg Lys Pro Asp 
275 280 285 

Val Glu Gly Val He Glu Phe Leu Cys Thr Glu His Gly Phe Ser Glu 
290 295 300 

Asp Arg Val Arg Asp Ala Leu Lys Lys Phe Glu Gly Ala Ser Ser Thr 
305 310 315 320 

Gin Lys Ser Leu Glu Asp Trp Phe 
325 



<210> 


30 


<211> 


336 


<212> 


PRT 


<213> 


Afu 


<400> 


30 



Met Gly Ala Asp He Gly Asp Leu Phe Glu Arg Glu Glu Val Glu Leu 
15 10 15 

Glu Tyr Phe Ser Gly Lys Lys He Ala Val Asp Ala Phe Asn Thr Leu 
20 25 30 

Tyr Gin Phe He Ser He He Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 

Asp Ser Gin Gly Arg He Thr Ser His L.eu Ser Gly He Leu Tyr Arg 
50 55 60 

Val Ser Asn Met Val Glu Val Gly He Arg Pro Val Phe Val Phe Asp 
. 65 70 75 80 

Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu He Glu Glu Arg Lys Lys 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 

Asp Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
115 120 125 

Tyr lie Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
130 135 140 
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Phe vsl Asp Ala Pre Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
141 15C ISt 16 C 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 

Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala lie Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Glu lie 
195 200 205 

lie lie Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 " 220 

Leu He Asp He Ala He Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
225 230 235 240 

Lys Gly Val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 

Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 

Glu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 
275 280 285 

Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Gin Ala Thr Leu Glu Arg Trp Phe 
325 330 335 



<210> 31 
<211> 27 
<212> DNA 
<213> synthetic 

<220><221> misc_f eature<222> (17) . . (18) <223> n = degeneracy 
<220><221> misc_f eature<222> (27) . . (28) <223> n » degeneracy 
<400> 31 

atctctagca ctgctgtntt ygayggn 



27 



<210> 32 
<211> 31 
<212> DNA 
<2l3> synthetic 

<220x221> misc_f eature<222> (21) . . (22) <223 > n = degeneracy 
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<220><22l> mascjeature<222> ( 2 7 ) . . ( 2 8 ) <22 3 > n = degeneracy 
<400> 32 

gatctctagc actgctgarg gngargcnca r 31 

<210> 33 
<211> 28 
<212> DNA 
<213> synthetic 
<400> 33 

gatctctagc actgctcarg aytaygay 2 8 

<210> 34 
<211> 31 
<212> DNA 
<213> synthetic 

<220><221> misc_feature<222> (19) . . (20) <223 > n « degeneracy 
<220><221> misc_f eature<222> (25) . . (26) <223> n * degeneracy 
<400> 34 

cttaaggtag gactacytgn gcytcnccyt c 31 

<210> 35 
<211> 30 
<212> DNA 
<213> synthetic 
<400> 35 

ttaaggtagg actacytcrt aytcytgrct 3 0 

<210> 36 
<211> 30 
<212> DNA 
<213> synthetic 

<220><221> misc_feature<222> (27 ) . , (28 ) <223> n = degeneracy 
<400> 36 

ttaaggtagg actacytcrt aytcytgnga 3 0 
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<210> 3*7 
<211> 30 
<212> DNA 
<213> synthetic 

<220><22l> misc__feature<222> {24 ) . . (25) <223> n * degeneracy 
<220><221> misc__feature<222> (27) . . (28 ) <223> n = degeneracy 
<400> 37 

ttaaggtagg actacrttrw artcngtncc 3 0 

<210> 38 
<211> 16 
<212> DNA 
<213> synthetic 
<400> 38 

gatctctagc actgct 16 

<210> 39 
<211> 17 
<212> DNA 
<213> synthetic 
<400> 39 

ccttaaggta ggactac 17 

<210> 40 
<211> 27 
<212> DNA 
<213> synthetic 
* <400> 40 

tatcgcagcg atccacttct cctctgc 27 

<210> 41 

<211> 27 

<212> DNA 

<213> synthetic 
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<400> 41 

cttaaacggc aacctgagaa ggcttgc 2"/ 

<210> 42 
<21t> 28 
<212> DNA 
<213> synthetic 
<400> 42 

ctatctcctt ctgcttgaaa acaggagg 2 8 

<210> 43 
<211> 27 
<212> DNA 
<213> synthetic 
<400> 43 

acaagggaac agctcgtcga tatcgcg 27 

<210> 44 
<211> 32 
<212> DNA 
<213> synthetic 
<r400> 44 

taacgaattc ggtgcagaca taggcgaact ac 32 

<210> 45 
<211> 33 
<212> DNA 
-<213> synthetic 
<400> 45 

cggtgtcgac tcaggaaaac cacctctcaa gcg 3 3 

<210> 46 

<211> 37 

<212> DNA 

<213> synthetic 
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<400> 4fc 

cacaggaaac agaccatggg tgcagacata ggcgaac 37 

<210> 47 

<211> 1017 

<212> DNA 

<213> Ave 

<400> 47 



atgggtgcag 


acataggcga 


actactcgag 


agagaagaag 


ttgaacttga 


gtacttctcc 


60 


gggagaaaaa 


tagctattga 


tgcttttaac 


actctttacc 


agttcatatc 


tatcataagg 


120 


caacctgacg 


gcactccttt 


gaaggattct 


cagggtagaa 


tgacctcaca 


cctctccggc 


180 


atcctgtacc 


gcgtgtcaaa 


catgatcgag 


gttggaatga 


gacccatttt 


cgttttcgat 


240 


ggtgagcctc 


ctgttttcaa 


gcagaaggag 


atagaggaac 


gaaaggaaag 


aagagctgaa 


300 


gcagaggaga 


agtggatcgc 


tgcgatagag 


agaggagaga 


agtacgcaaa 


gaagtacgct 


360 


caggcagcgg 


cgagggttga 


tgaatacatc 


gtcgagtcgt 


caaagaagct 


gcttgagtat 


420 


atgggagttc 


catgggttca 


ggcgccgagt 


gagggagagg 


cacaggctgc 


atacatggca 


480 


gcgaagggcg 


atgtagattt 


tactggctcg 


caggattacg 


actcgcttct 


cttcggcagc 


540 


ccaaagcttg 


caagaaatct 


cgcgattact 


ggaaagagga 


agctgcccgg aaagaatgtt 


600 


tacgttgagg 


tcaaaccaga 


gataatagac 


ttaaacggca 


acctgagaag gcttggaata 


660 


acaagggaac 


agctcgtcga 


tatcgcgttg 


ctcgtgggaa 


cggactacaa 


cgaaggagtg 


720 


aagggcgttg 


gggtcaagaa 


ggcctacaag 


tacataaaaa 


cctacggaga 


tgttttcaaa 


780 


gctctcaagg 


ccttaaaggt 


agagcaggag 


aacatagagg 


agataagaaa 


cttcttcctg 


840 


aacccgcctg 


ttacgaacaa 


ctacagcctc 


cacttcggaa 


agccagacga 


tgagaagatt 


900 


atcgagttcc 


tgtgtgaaga 


gcacgacttt 


agcaaggata 


gggtagagaa 


ggccgttgag 


960 


aagctgaaag 


caggaatgca 


agcctcgcaa 


tcaacgcttg 


agaggtggtt 


ttcctga 


1017 



<210> 48 
<211> 337 
<212> PRT 
<213> Ave 
<400> 48 

Met Gly Ala Asp He Gly Glu Leu Leu Glu Arg Glu Glu Val Glu Leu 
15 10 15 

Glu Tyr Phe Ser Gly Arg Lys He Ala He Asp Ala Phe Asn Thr Leu 
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20 25 3C 

Tyr Gin Phe lie Ser lie He Arg Gin Pro Asp Gly Thr Pre Leu Lys 
3S 40 45 

Asp Ser Gin Gly Arg Met Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Val Ser Asn Met lie Glu Val Gly Met Arg Pro He Phe Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Val Phe Lys Gin Lys Glu He Glu Glu Arg Lys Glu 
85 ' 90 95 

Arg Arg Ala Glu Ala Glu Glu Lys Trp He Ala Ala He Glu Arg Gly 
100 105 HO 

Glu Lys Tyr Ala Lys Lys Tyr Ala Gin Ala Ala Ala Arg Val Asp Glu 
115 120 125 

Tyr He Val Glu Ser Ser Lys Lys Leu Leu Glu Tyr Met Gly Val Pro 
130 135 140 

Trp Val Gin Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Asp Phe Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 

Leu Phe Gly Ser Pro Lys Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Glu Val Lys Pro Glu He 
195 200 205 

lie Asp Leu Asn Gly Asn Leu Arg Arg Leu Gly He Thr Arg Glu Gin 
210 21S 220 

Leu Val Asp He Ala Leu Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
225 * 230 235 240 

Lys Gly Val Gly Val Lys Lys Ala Tyr Lys Tyr He Lys Thr Tyr Gly 
245 250 255 

Asp val Phe Lys Ala Leu Lys Ala Leu Lys Val Glu Gin Glu Asn He 
260 265 270 

Glu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asn Asn Tyr 
275 280 285 

- Ser Leu His Phe Gly Lys Pro Asp Asp Glu Lys He He Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Lys Asp Arg Val Glu Lys Ala Val Glu 
305 310 315 320 

Lys Leu Lys Ala Gly Met Gin Ala Ser Gin ser Thr Leu Glu Arg Trp 
325 330 335 

Phe Ser 
<210> 49 
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5 3 



<2l2> DNA 



<213> synthetic 



<400> 49 



cccgtctcgc tggtgaaaag aaaaaccacc ctggcgccca atacgcaaac cgc 



53 



<210> 50 
<211> 26 
<212> DNA 
<213> synthetic 

<220x221> misc_feature<222> (22) . . {23 ) <223> misc. feature 
<220x221> misc_feature<222> (1) . . (2) <223> misc. feature 
<220x221> misc_feature<222> (2) . . <3 ) <223> misc. feature 
<400> 50 

tntnccagag cctaatttgc cagtna 26 

<210> 51 
<211> 26 
<212> DNA 
<213> synthetic 

<220x221> misc_feature<222> (1) . . (2) <223> misc. feature 
<220x221> misc_feature<222> (2) . . (3) <223> Misc. Feature 
<220x221> misc_feature<222> (22) . . (23 ) <223> Misc. Feature 
<400> 51 

tntnccagag cctaatttgc cagtna 26 

<210> 52 
<211> 24 
<212> DNA 
<213> synthetic 

<220x221> misc_feature<222> (22 ) . . (23 ) <223> misc. feature 
<400> 52 

ttccagagcc taatttgcca gtna 24 
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<210> 53 
<211> 24 
<212> DNA 
<213> synthetic 

<220><221> misc_f eature<222> (22) . . (23) <223> misc. feature 
<400> 53 

ttccagagcc taatttgcca gtna 24 

<210> 54 
<211> 25 
<212> DNA 
<213> synthetic 
<400> 54 

cttaccaacg ctaacgagcg tcttg 25 

<210> 55 
<211> 14 
<212> DNA 
<213> synthetic 
<400> 55 

gctcccgcag acac 14 

<210> 56 
<211> 15 
<212> DNA 
<213> synthetic 

<220><221> misc_feature<222> (1) . . (2) <223 > raise, feature 
<400> 56 

tntacgccac cagct 15 

<210> 57 

<211> 12 

<212> DNA 

<213> synthetic 
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<400> 5^ 

cgctgtctcg ct 12 

<210> 58 
<211> 19 
<212> DNA 
<213> synthetic 
<400> 58 

gctcaaggca ctcttgccc 19 

<210> 59 

<211> 63 

<212> DNA 

<213> synthetic 

<400> 59 

atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg 6 0 

ata 6 3 

c210> 60 
<211> 45 
<2X2> DNA 
<213> synthetic 
<400> 60 

ttttttttta attaggctct ggaaagacgc tcgtgaaacg agcgt 45 

<210> 61 
<211> 14 
<212> DNA 
<213> synthetic 
<400> 61 

cttcggagtt tggg 14 

<210> 62 
<211> 16 
<212> DNA 
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<213> synthetic 

<220><221> misc_f eature<222> ( 1 ) . . {2 ) <223 > misc. feature 
<400> 62 

ancttcggag tttggg 16 

<210> 63 
<211> 16 
<212> DNA 
<213> synthetic 

<220><221> misc_feature<222> (1) . . (2) <223> misc. feature 
<400> 63 

cncttcggag tttggg 16 

<210> 64 
<211> 16 
<212> DNA 
<213> synthetic 

*220x221> misc__feature<222> (1) ♦ . (2) <223> misc. feature 
<400> 64 

gncttcggag tttggg 16 

<210> 65 
<211> 16 
<212> DNA 
<213> synthetic 

<220><221> misc_feature<222> (1) . . <2)<223> misc. feature 
<400> 65 

tncttcggag tttggg 16 

<r210> 66 
<211> 25 
<212> DNA 
<213> synthetic 
<400> 66 

gggttgtgga gtgagtgttc aagta 25 
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<210> 


67 


<211> 


27 


<212> 


DNA 


<213> 


synthetic 


<400> 


67 



ccatcctaat acgactcact atagggc 27 

<210> 68 
<211> 21 
<212> DNA 
<213> synthetic 
<400> 68 

ctcatacagt tacttgtctt c 21 

<210> 69 

<211> 489 

<212> RNA 

<213> Homo sapiens 

<400> 69 

gaacucacua uagggcucga gcggccgccc gggcaggucc gccaccaaaa ugcagauuuu 60 
cgugaaaacc cuuacgggga agaccaucac ccucgagguu gaacccucgg auacgauaga 12 0 
aaauguaaag gccaagaucc aggauaagga aggaauuccu ccugaucagc agagacugau 18 0 
cuuugcuggc aagcagcugg aagauggacg uacuuugucu gacuacaaua uucaaaagga 240 
gucuacucuu caucuugugu ugagacuucg ugguggugcu aagaaaagga agaagaaguc 3 00 
uuacaccacu cccaagaaga auaagcacaa gagaaagaag guuaagcugg cuguccugaa 360 
auauuauaag guggaugaga auggcaaaau uagucgccuu cgucgagagu gcccuucuga 42 0 
" ugaauguggu gcuggggugu uuauggcaag ucacuuugac agacauuauu guggcaaaug 480 
uugucugac 48 9 

<210> 70 

<211> 52 

<212> DNA 

<213> synthetic 
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<400> 70 

ggaacacgac tcactatagg gasagtctct gccgcccttc tgtgcctgct gc 5; 

<210> 71 
<211> 52 
<2X2> DNA 
<213> synthetic 
<400> 71 

ggaatacgac tcactatagg gaaagtctct gccgcccttc tgtgcctgct gc 52 

<210> 72 
<211> 647 
<212> RNA 
<213> synthetic 
<400> 72 

gggaaagucu cugccgcccu ucugugccug cugcucauag cagccaccuu cauuccccaa 6 0 

gggcucgcuc agccagaugc aaucaaugcc ccagucaccu gcuguuauaa cuucaccaau 120 

aggaagaucu cagugcagag gcucgcgagc uauagaagaa ucaccagcag caaguguccc 180 

aaagaagcug ugaucuucaa gaccauugug gccaaggaga ucugugcuga ccccaagcag 24 0 

aaguggguuc aggauuccau ggaccaccug gacaagcaaa cccaaacucc gaagacuuga 300 

acacucacuc cacaacccaa gaaucugcag cuaacuuauu uuccccuagc uuuccccaga 3 60 

cacccuguuu uauuuuauua uaaugaauuu uguuuguuga ugugaaacau uaugccuuaa 42 0 

guaauguuaa uucuuauuua aguuauugau guuuuaaguu uaucuuucau gguacuagug 4 80 

uuuuuuagau acagagacuu ggggaaauug cuuuuccucu ugaaccacag uucuaccccu 54 0 

gggauguuuu gagggucuuu gcaagaauca uuaauacaaa gaauuuuuuu uaacauucca 6 00 

augcauugcu aaaauauuau uguggaaaug aauauuuugu aacuauu 64 7 

<210> 73 
<211> 16 
<212> DNA 
<213> synthetic 
<400> 73 

ttcttcggag tttggg 16 
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< 2 1 0 : 7 4 
<211> 26 
<212> DNA 
<213> synthetic 
<400> 74 

ccgtcacgcc tccttcggag tttggg 26 

<2X0> 75 
<211> 24 
<212> DNA 
<213> synthetic 
<400> 75 

aacccaaact ccgaaggagg cgtg 24 

<210> 76 
<211> 29 
<212> DNA 
<213> synthetic 
<400> 76 

gcgcagtgag aatgaggagg cgtgacggt 2 9 

<210> 77 
<211> 17 
<212> DNA 
<213> synthetic 

<220><221> misc_feature<222> ( 1) . . (2 ) <223 > misc. .feature 
<400> 77 

cnctcattct cagtgcg 17 

<210> 78 

<211> 30 

<212> DNA 

<213> synthetic 

<400> 78 
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